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This invention is a novel method for detecting and localizing 
specific nucleic acid sequences in a sample with a high degree of 
sensitivity and specificity. Hie method and novel compositions used 
in the method involve the use of Probe Nucleic Acids, the production of 
nucleic acid binding regions and the use of nucleic acid Target Binding 
Assemblies to detect and localize specific Target Nucleic Acids. The 
detection and localization of the Target Nucleic Add is accomplished 
even in the presence of nucleic acids which have similar sequences. 
The method provides for a high degree of amplification of the signal 
produced by each specific binding event In particular, methods and 
compositions are presented for the detection of HIV and HPV nucleic 
acid in samples. These methods and compositions find use in dia gtyy riff 
of disease, genetic monitoring, fareosics, and analysis of nucleic add 
mixtures. Some of the novel compositions used in the detection method 
are useful in preventing or treating pathogenic conditions. 
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DESCRIPTION 

MgTHQD QF DET CTQN QF NUCLEIC ACIDS 
WITH A SPECIFIC SEQUENCE COMPOSITION 

5 

Pttkgrcmul Qf ftp fovffltipp 

LFwIdpfteftry^wm 

This invention provides a method and compositions for use in binding, detecting, and 
amplifying the detection of specific Ttrget Nucleic Acid sequences in a sample with fidelity and 
10 accuracy, even in the presence of closely related but different nucleic acids . The binding may involve 
the chaperoning and assembly of specific molecules into Target Binding Assemblies which 
specifically bind Target Binding Regions formed by the hybridization of Probe Nucleic Acids and 
Target Nucleic Acid sequences. The amplifying may involve the chaperoning and/or assembly of 
specific molecules into Booster Binding Assemblies which specifically bind Booster Binding 

15 Regions formed by the hybridization of Booster Nucleic Acids with Probe Nucleic Adds, Target 
Nucleic Acids, or other Booster Nucleic Acids. A method, and compositions, involving Hairpin 
Nucleic Adds is also provided to enable control of the size of specifically or non-specifically 
elongated Booster Nucleic Acids and Booster Binding Assemblies used in the amplification. The 
detecting involves providing one or more detectable labels, including radioactive, light- or 

20 fluoresceaucmitting, enzymatic, or other detectable or signal-generating molecules, in association 
with die Probe Nucleic Acid, die Target Binding Assembly, die Booster Nucleic Acid, the Booster 
Binding Assembly, or the Hairpin Nucleic Acid A method is presented for isolating nucleic acid 
fragments from an organism which has TBA component binding sites in order to create a probe 
nucleic acid and a TBA which is unique for that fragment and/or organism. Therapeutic and 

25 prophylactic uses of die Target Binding Assemblies and compositions for such use are also provided. 



2. Backpro i'nH n ^scription of Related Art 
30 There are an increasing number of cases in which it is important to be able to detect nucleic 

acids containing a specific sequence, hereinafter named Target Nucleic Acids (TNAs), in a sample. 
It is desirable to be able to detect the TNAs with the smallest number f processing steps, with the 
simplest components and to the exclusion of other similar but different nucleic acids, hereinafter 
named Cousin Nucleic Acids (CNAs). It is desirable to be able to detect specific TNAs to the 
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exclusion of any and all CNAs in the detection sample without the necessity of amplification or other 
post-detection processing. 

There art numerous methods which use immobilized or tagged nucleic acids as probes for 
TNAs. However, using known methods, it is difficult to discriminate between a TNA bound to the 
5 Probe Nucleic Acid (PSA) as opposed to a CNA bamd to the PN A. For example, one or more base 

mismatches between the PN A and a CNA can still result in a CNA-PNA hybridization which is 
almost indistinguishable from a TNA-PNA hybridization. Thus, hybridization alone is not an 
optimal in4'i*?tnr that a PNA has hybridized to a unique TNA. 

There are many situations in which a PN A would be used to try to determine whether a TNA 

10 was present in a sample which may contain CNAs. Hybridization of the PNA to any CNA in this 

situation would limit the diagnostic value that the PNA might have for the detection of a TNA, 
absent additional verification. Furthermore, it is desirable to be able to detect and localize TNAs 
with low copy numbers in samples which may contain many copies of CNAs, without the necessity 
of creating additional copies of the TNA. It would also be desirable to be able to confirm the 

15 presence of CNAs, independent of the TNAs, without the necessity of separating the CNAs and 

TNAs in the sample. 

Furthermore, it would be desirable to be able to amplify the signal of even a low frequency 
hybridization of a particular TNA-PNA For this purpose, a method of polymerizing multiple copies 
of a label, hereinafter referred to as a Booster Nucleic Acid (BNA) onto the TNA-PNA would be 
20 desirable. 

The instant invention provides methods and compositions for achieving the foregoing 
desired objectives. As revealed by die following review, the instant compositions and methods have 
not been reported a suggested in the art A general and comprehensive review of the state of art of 
nucleic acid detection is provided in Keller, H., M.M. Manak (1989) DNA Probes, Stockton Press. 

25 A method has been reported for detecting base pair mismatches by chemical means in order 

to determine whether a PNA has hybridized to a CNA rather than to a TNA. In US. Patent No. 
4,794,075 to Ford et ai y a method for distinguishing fragments of DNA which contain single base 
mismatches from their perfectly paired homoiogs is discussed. Single stranded regions within a 
duplex fragment are modified with carbodiimide, which reacts with unpaired guanine (G) and 

30 thymine (T) residues in DNA. Linear duplex DNA molecules do not react, while DNA molecules 

with single base mismatches react quantitatively. Following reaction with carbodiimide, the DNA 
molecules are fractionated on high percentage polyacrylamide gels such that modified and 
unmodified fragments can be distinguished. Ford et ai applied this technique in order to locate and 
purify DNA sequence differences responsible for phenotype variation and inherited disease. 
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Although this method is useful for following variations in genetic material, it has a large number of 
steps, it requires costly components, and it does not offer a direct means of determining whether a 
PNA has hybridized to the TNA exclusive of CNAs in the sample. 

There have been some nttanpts to assure that at least a portion of the hybridization between 
5 the PNA and another nucleic acid is complcmentaiy. One method involves the monitoring of 
transcription products which are produced if the PNA hybridizes to a nucleic acid sufficiently to be 
transcribed from a promoter site contained in the probe. U.S. Patent No. 5,2 15,899 to Dattagupta 
disctoeshowsperiffctuclekac^ through the use of a hairpin probe which, 

upon hybridization with and ligation to a target sequence, is capable of being transcribed. The probe 

10 comprises a angle stranded ac^xmplemmtmy sequence which under tyhridiymg cn»d'rionS, forms 

a hairpin structure having a functional promoter region, and further comprises a single stranded 
probe sequence extending fixm the 3' end of the hairpin sequence. Upon hybridization with a target 
sequence complementary to the probe sequence and ligation of the 3' end of the hybridized target 
sequence to the 5' end of the hairpin probe, the target sequence is rendered transcribable in the 

15 presence of a suitable RNA polymerase and appropriate ribooucleoside triphosphates (rNTPs). 
Amplification is accomplished by hyhridizi^ 

Thta to the PNA, adding the RNA polymerase ari hybrids, and allowing 

transcription to proceed until a desired amount of RNA transcription product has accumulated. That 
method generally and specifically involves the use of hairpin DNA formed with a single stranded 

20 unpaired end to anneal a target sequence. When the target sequence is bound, the production of 
RNA transcription products is enabled. Thus, the method involves the detection of secondary 
transcription products rather than the use of a nucleic acid binding assembly to directly immobilize 
and/or localize a target sequence. A CNA could easily bind to the probe, and the lade of 
complementarity would not necessarily interfere with the formation of a CNA-PNA hybrid which 

25 could then support the production of unwanted transcription products. 

A CNA bound to the PNA might be detected if the lack of complementarity interferes with 
the susceptibility of the hybrid CNA-PNA pair to be cut by a restriction endonuclease. In U.S. 
Patent No. 5,1 18,605 to Unfea and U.S. Patent No. 4,775,6 19 to Urdea, novel methods for assaying 
a nucleic acid analyte were provided, which employ polynucleotides having oligonucleotide 

30 sequences substantially homologous to a sequence of interest in the analyte, where the presence or 
absence of hybridization at a predetermined stringency provides for the release fa label from a 
support. Various techniques are employed for binding a label to a support, whereupon cleavage f 
either a single or double strand, a label may be released from a support, and the release of the label 
can be detected as indicative of the presence of a particular polynucleotide sequence in a sample. 
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However, this technique has the shortcoming that a CNA-PNA pair could be cut by the restriction 
c nrinmc lease, even if there is a mism a tch , so long as the mi s m a tch was outside of theendoouclease 
recognition region. This would lead to failure of the assay to identify a CNA-PNA hybrid 

Another method uses a branched DNA probe to detect nucleic acids. U.S. Patent No. 
5, 1 24,246 to Urdea et aL discloses linear or branched oligonucleotide multimers useful as amplifiers 
in bioc h e mi c al assays which comprise (1) at least one first single-stranded oligonucleotide unit 
(PNA) that is complementary to a single-stranded oligonucleotide sequence of interest (TNA), and 
(2) a multiplicity of second single-stranded, oligonucleotide units that are complementary to a single- 
stranded labeled oligonucleotide. Although amplified sandwich nucleic acid hyhridizations and 
immunoassays using the multimers are described, the method has the limitat i on that PNA-CNA 
hybridization could occur and would result in production of unwanted signal. 

In addition to methods for identification of TNAs, methods have been disclosed for the 
amplification of this DNA. In U.S. Patent No. 5,200,3 14 to Urdea, an analyte polynucleotide strand 
having an analyte sequence (TNA) is detected within a sample containing polynucleotides by 
contacting the analyte polynucleotide with a capture probe (PNA) under hybridizing conditions, 
where the capture probe has a first binding partner specific far the TNA, and a second binding 
sequence specific far a solid phase third binding partner. The resulting duplex is then immobilized 
by specific binding between the binding partners, and non-bound polynucleotides are separated from 
the bound species. The analyte polynucleotide is optionally displaced from the solid phase, then 
amplified by PCR. The PCR primers each have a polynucleotide region capable of hybridizing to 
a region of the analyte polynucleotide, and at least one of the primers further has an additional 
binding partner capable of binding a solid-phase binding partner. The amplified product is then 
separated from the reaction mixture by specific binding between the binding partners, and the 
amplified product is detected. Although it is possible to confirm (by PCR) that a particular nucleic 
acid has hybridized with the PNA, confirmation is expensive and involves multiple steps. 

As for reports that involve the interaction of a double stranded nucleic acid and a DNA- 
binding protein, a method has been described whereby a sequence of immobilized DNA which 
contains binding sites for a single protein is used to purify that protein. U.S. Patent No. 5,122,600 
to Kawaguchi et aL discloses a DNA-immobilized microsphere comprising DNA having base 
sequences which specifically bind a partiailar protein, and a carrier having a particle size of not more 
than 50 pm and not less than 0.01 pm which does not adsorb aiy protein, said carrier and said DNA 
chains being bcxiiKl to each othff by a chemical using said 

microsphere. As this is a purification method for a protein, it does not disclose a method of detection 
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of a TNA nor a method whereby more than one protein is bound to a double stranded nucleic acid 
for the purposes of detection and localization of specific TNA sequences. 

In EP 0453 301, a method for detecting a polynucleotide target sequence in a sample was 
described whoein sequences in a TNA are detected by hybridizing a first and a second PN A to the 
5 TNA. Each of said first and second PNAscoctainod a prc-fbrmod duplex sequence, or a duplex that 
is fanned through chain extension, capable of binding a nucleotide sequence specific billing protein. 
A method for binding a nucleotide specific binding protein to a duplex formed between a TNA and 
a PNA only upon formation of a duplex between the PNA and TNA is neither disclosed nor 
suggested. 

10 In US. Patent No. 4,556,643, a method was disclosed fir the non-radioactive detection of 

specific nucleotide sequences in a sample which involved hybridization of a probe containing DN A 
binding protein specific sequences. However, this disclosure neither taught nor suggested a method 
fir binding a nucleotide specific bending protein to a duplex formed between a TNA and a PNA only 
upon formation of a duplex between sequences present in the PNA and sequeaoes present in the 

15 TNA 



ffrirf S vm^y fl f the In Yffltiftn 
Disclosed arc methods by winch specif 

20 through the use of Probe Nucleic Acids (PNAs) which, upon hybridization with TNAs, are capable 
of binding Target Binding Assemblies (TBAs). Each TBA binds at least one specific region of the 
PNA-TNA hybrid pair, the Target Binding Region (TBR). The TBA is comprised of one or more 
molecules, one or more of which can bind to TBR sequences in a specific and sequence or 
confirmation dependent manner. The TBA may comprise one or more piloting sequences, called 

25 "PILOTS" or "Asymmetry Sequences," which assemble and constrain the nucleotide binding 
compone nt s of the TBA to specific geometries. The PILOTS act to assemble specific nucleic acid 
recognition units or other pilots to which specific nucleic acid recognition units are attached into the 
TBAs in a predetermined fashion. The TBA may also contain one or more molecules which anchor 
or localize the TBA. Novel TBAs having unique discriminating characteristics which surprisingly 

30 render the TBAs useful not only as diagnostic tools but also as prophylactic or therapeutic 
compounds, are also disclosed. Disclosed art methods and compositions for utilization of the PNAs, 
TBRs, TBAs, and TBA PILOTS, including their utilization as components of diagnostic and 
forensic test kits and the utilization of the novel TBAs as prophylactic or therapeutic agents. 
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The PNAs, in addition to TNA-spcrific sequences, may also contain one or more sequences, 
1/2 BBRs, capable of hybridizing with complementary 1/2 BBRs in Booster Nucleic Acids 
(BNAs). Through hybridization of added BNAs to the starter 1/2 BBRs present in the PNAs, 
extensions of the PNAs art made in the form of PNA-BNA and then BNA-BNA hybrids. These 
5 extensions can contain one or more Booster Binding Regions (BBRs). Each BBR is capable of 
binding a Booster Binding Assembly (BBA). The BBA is comprised of molecules, one or more of 
which can bind to a BBR in a specific and sequence or conformation dependent manner. The BBA 
may comprise one or more piloting sequences, called "PILOTS" or "Asymmetry Sequences,*' winch 
assemble and constrain die nucleotide binding components of the TBA to specific geometries. The 
10 PILOTS act to assemble specific nucleic acid recognition units or other pilots to which specific 
nucleic acid recognition units are attached into the BB As in a predetermined fashion. The BBA may 
contain molecules which anchor or localize the BBA or which allow for detection of the bound BBAs 
and thereby of the TBA-TNA-PNA complexes to which they, in turn, are bound. Disclosed are 
methods and eonyoshions fir utilization of the 1/2 BBRs, BNAs, BBRs, BBAs, and BBA PILOTS, 
15 including their utilization as components of diagnostic and forensic test kits. 

Methods and compositions are disclosed for the use of Hairpin Nucleic Acids (HNAs) as 
capping structures. The HNAs contain a self-hybridizing region and a single stranded 1/2 BBR 
which, under hybridizing conditions, can hybridize directly to the 1/2 BBRs in the PNAs or the 1/2 
BBRs in BNAs already bound to the PNAs, to terminate the extension of BNAs onto the PNA or 
20 onto other BNAs. 

Methods and compositions arc disclosed for test procedures and the production of a test kit 
containing PNAs, TBAs, TBRs, BNAs, BBRs, BBAs and HNAs for the detection, localization and 
differentiation of specific nucleic acid sequences, including nucleic acid sequences which are found 
in human cells, in the Human Immunodeficiency Virus (HIV), Human Papillomavirus (HPV), and 
25 in other nucleic acid containing systems including viruses and bacteria 

Accordingly, it is an object of this invention to provide methods and compositions for use 
in binding, detecting, and amplifying the detection of specific Target Nucelic Acid sequences in a 
sample with fidelity and accuracy, even in the presence of closely related but different nucleic acid 
sequences. 

30 Acooniingly, it is an object of this invention to provide methods and compositions for the creation 

of Target Binding Assemblies which specifically bind Target Binding Regions formed by the 
hybridization of Probe Nucleic Acids and Target Nucleic Acid sequences. 

Another object of this invention is to provide a method and compositions for the creation 
of Booster Binding Assemblies which specifically bind Booster Binding Regions fanned by the 
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hybridization of Booster Nucleic Acid sequences with Probe Nucleic Acids, Booster Nucleic Acids 
and Hairpin Nucleic Acids. 

Another object Of this invention is to provide a method and enmpngirirmg cntitpinmg Hairpin 
Nucleic Acids which enable the control of the size of specifically or noo-spccifically elongated 
5 Booster Nucleic Acids and Booster Binding Assemblies used in amplification of PNA-TNA 

hybridization events. 

Another object of this invention is to provide a method and compositions for use in the 
selection, assembly and or chaperoning of specific molecules, each with nucleic acid binding 
discriminating capabilities, into Target and Booster Binding Assemblies. 
10 Another object of this invention is to provide a method and compositions for use in 

amplifying the detection of Target Binding assemblies bound to Target Binding Regions using 
Booster Binding Assemblies and Booster Nucleic Acids. 

Another object of this invention is to provide a method and compositions which allow the 
use of one or more detect ab le labels, including but limited to radioactive labels, light mitring 
15 fluorescent, enzymatic or other signal generating molecules. These labels arc used in association 

with Robe Nucleic Acids, Target Binding Assemblies, Booster Binding Assemblies, Booster Nucleic 
Adds or Hairpin Nucleic Acids. 

Another object of this invention is to provide a method for isolating nucleic acid fragments 
farm an organism which has TBA component binding sites in order to create Probe Nucleic Acids 
20 and TB As which are unique for that fragment or organism. 

3rigf DeflyiptiW* Qf the Pruwings 
The following illustrations are contained in Figure 1: Figure 1-1 is a PNA containing a 1/2 
TBR, which is a single-stranded sequence which is complementary to a TNA and a 1/2 BBR 

25 sequence. Figure 1-IIa is a TNA to which is added the components of Figure 1-1, and, under 
hybridizing conditions, binds the PNA to form the components of Figure 1-IIIa, a PNA-TNA 
hybrid containing at least one TBR. Figure 1-IVa is a BNA which is added to the components of 
Figure Mlla and, under hybridizing conditions, binds the 1/2 BBR of Figure 1-QIa to form a PNA- 
BNA hybrid containing a BBR shown in Figure 1-Va. 

30 Figure 1-IIb is a BNA which is added the components of Figure 1*1, and which, under 

hybridizing conditions, binds the PNA to form the components of Figure 1-IIIb, a PNA-TNA 
hybrid containing a BBR. Figure 1-IVb is a TNA to which is added the components of Figure 1- 
mb and which, under hybridizing conditions, binds the 1/2 TBR of Figure 1-IIIb to form a PNA- 
BNA hybrid containing a TBR shown in Figure 1-Vb. 
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Figure 1-IIc is a HNA which is added to the components of Figure 1-1 and which, under 
hybridizing conditions, binds the PNA to form the components of Figure 1-IIIc, a PNA-HNA 
hybrid containing a BBR Figure 1-IVcb a TNA which is added to the components of Figure 1-IEc 
and which, under hybridizing conditions, binds the 1/2 TBR of Figure 1-IDc to form a PNA-BNA 
hybrid containing a BBR shown in Figure 1-Vc. 

The hybrids which form the TBRs and BBRs air useful in the present invention. The PNAs 
and BNAs, as indicated in Figure 1 , may contain no attached support and/or indicator (OSA), or an 
attached support or other means of localization, including, but not limited to, attachment to beads, 
polymers, and surfaces, and/or indicators. 

Figure 2a is a diagram of strategies for polymerization of BNAs onto PNAs and capping 
by HNAs. 

Figure 2b is a diagram of additional strategies for amplifying PNA-TNA signals via 
polymerization of BNAs and capping by HNAs. 

Figure 3 is a diagram showing the use of BNAs containing multiple 1/2 BBRs per BNA. 

Figure 4a is a diagram showing the binding of TBAs and BBAs to TBRs and BBRs, and 
the ability of the TBA to discriminate between TNAs and CNAs. According to this embodiment, 
if the TBA is immobilized, either on a bead, microliter plate surface, or any other such surface, only 
complexes such as complex X would be retained and detected, while complexes such as complex XI 
would not 

Figure 4b is a diagram exemplifying events similar to those shown in Figure 4a but in a 
slightly different order of occurrence. 

Figure 5 is a diagram exemplifying PNAs containing between one 1/2 TBR and no 1/2 
BBR to PNAs containing up to five 1/2 TBRs and one 1/2 BBR. The (a) and (b) members of each 
numeral (I, II, m, IV, V) farm a set which, upon hybridization to a TNA, provide TBRs cither with 
((a) members) or without ((b) members) an available 1/2 BBR for amplification via hybridization 
to BNAs having complementary 1/2 BBRs. 

Figure 6a is a diagram exemplifying a particular TNA having two 1/2 TBRs which, upon 
binding an appropriate PNA, forms two closely associated TBRs capable of binding two TBAs. A 
1/2 BBR is also provided for amplification. 

Figure 6b is a diagram showing the same events as in Figure 6a except here, a double TBA 
is used so that discrimination between single TBRs that occur in normal cellular samples may be 
discriminated from abnormal, double TBRs. 

Figure 6c is a diagram showing the same scenario as in Figure 6a except that here, five 
TBRs are identified in the TNA. Each TBR msy be bound to a TBA same or different, and each 
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TBA may be differentially labeled, allowing for confirmation that all five sites arc present in the 
TNA. 

Figure 6d is a diagram of the same events as in Figure 6c except here, a double TBA is 
shown, extending what is shown in Figure 6b to the use ofihe double TBA. An example of the TNA 
5 shown in item II in Figures 6a, 6b, 6c and 6d is HIV single stranded DNA or RNA. 

Figure 7 shows the HIV LTR as a TN A, and two PNAs, and a strategy for detection of the 
TNA using the PNAs. 

Figure 8 is a schematic of one embodiment of the invention wherein a target binding 
assembly is used to bind a hybrid TNA-PNA, and booster binding assemblies are used to bind 
1 0 polymerized BNAs. 

Figure 9 is a schematic of a modular TBA in which assembly sequences, linker sequences, 
and asymmetry sequences are used to chaperone desired nucleic acid recognition units together to 
form a TBA. 

Figure 10 shows modular TB As useful in detection of HTV-specific sequences. 
IS Figure 11 shows modular TBAs useful in die detection of human papillomavirus sequences. 

Each unit of E2 is actually a dimer of die DNA binding portion of E2. 

Figure 12a is a schematic of TNA fractionation and shift in mobility due to binding of a 

TBA. 

Figure 12b is a schematic of TNA fractionation and enhanced shift in mobility due to 
20 binding of BBAs in addition to TBAs. 

Figure 13 shows a detection strategy for deletion sequences; an example of use of this 
strategy is for a human papillomavirus integration assay. 

Figure 14 shows assembly of higher order TBAs through use of nucleic acid recognition 
units, linker, assembly, and asymmetry sequences such that various Target Binding Assemblies 
25 specific to binding sites in the HTV LTR art formed. 

Figure 15 shows assembly of higher order TBAs through use of DNA recognition units, 
linker, assembly, and asymmetry sequences such that various Target Binding Assemblies specific 
to binding sites in the HPV genome are formed. 

Figure 1 6 shows the discrimination achieved by using a complex TBA and the ability of 
30 endogenous competitor target binding molecules to eliminate binding of the TBA to a cousin nucleic 
acid but not from the TNA which contains the appropriate orientation of more than one site 
recognized by the TBA. 
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Figure 1 7 shows the ability of a TBA to specifically be targeted to bind to sites of sequence 
mismatch and to preferentially bind those sites over cousin sites which do not contain all of the 
targeted mismatches. 

Brief Description of die Sequences 
SEQ ID NO. 1 corresponds to Figure 5-Ia-l and shows the class I MHC NF-kB binding 

site. 

SEQ ID NO. 2 corresponds to Figure 5 (la) and shows the B2-microglobulin NF-kB 
binding site. 

SEQ ID NO. 3 corresponds to Figure 5 (la) and shows the kappa immunoglobulin NF-kB 
binding site. 

SEQIDN0.4oomspoodfetoFigurc5(Ia)arjds binding sites. 

SEQ ID NO. 5 corresponds to Figure 5 (la) and shows one of the HTV NF-kB binding sites. 

SEQ ID NO. 6 corresponds to Figure 5 (la) and shows the c~myc NF-kB binding site. 

SEQ ID NO. 7 corresponds to Figure 5 (Da) and shows a double HIV NF-kB binding site. 

SEQ ID NO. 8 corresponds to Figure 5 (Ha) and shows a double HIV NF-kB binding site. 

SEQ ID NOS. 9-16 correspond to Figure 5 (Ha) and show a double binding site with one 
site being an HIV NF-kB binding site, and the other site being an HTV SP1 binding site. 

SEQ ID NOS. 17-18 correspond to Figure 5(IIa)and show a double HIV SP1 binding site. 

SEQ ID NOS. 19-31 correspond to Figure 5 (Ilia) and show a double HIV NF-kB binding 
site and an HIV SP1 binding site. 

SEQ ID NOS. 32-33 correspond to Figure 5 (TVa) and show a quadruple binding site 
where two sites are HIV NF-kB binding sites and two sites are HIV SP1 binding sites. 

SEQ ID NO. 34 corresponds to Figure 5 Via) and shows a quintuple binding site where two 
sites are HIV NF-kB binding sites and three sites are HIV SP1 binding sites. 

SEQ ID NO. 35 is an example of a 1/2 BBR, in this case the OL 1 ( OL2 and OL3 elements 
of the bacteriophage lambda left operator, including intervening sequences. 

SEQ ID NO. 36 is an example of a 1/2 BBR, in this case the OR3, OR2 and OR1 elements 
of the bacteriophage lambda right operator, including intervening sequences. 

SEQ ID NO. 37 is the HIV LTR. 

SEQ ID NO. 38 is a PNA complementary to PNA of the HIV LTR. 
SEQ ID NO. 39 is a PNA complementary to a different PNA of the HIV LTR than SEQ 
ID NO. 38. 
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SEQ ID NO. 40 is a PNA ampleraentaiy to partcjftbcHIVLTRaadit aUocantainsa 1/2 
BBR and an overhang sequence for polymerizing BN As onto the PNA. 

SEQ ID NO* 41 is a BNA complementary to the SEQ ID NO. 40 1/2 BBR. 

SEQ ID NO. 42 is a BNA that will polymerize onto the SEQ ID NO. 41 BNA and which, 
with SEQ ID NOS. 40 and 41, creates a Pstl recognition site. 

SEQ ID NO. 43 is a BNA that is complementary to the SEQ ID NO. 42 BNA and which 
completes a BamUl recognition she. 

SEQ ID NO. 44 is an HNA which has a BamHl reaction site that ^hybridize with the 
BamOl recognition site created by SEQ ID NOS. 42 and 43 to the growing polymer. 

SEQ ID NO. 45 is a second PNA which, like SEQ ID NO. 40, is complementary to part of 
the HIV LTR, but not to the same sequence as SEQ ID NO. 40. SEQ ID NO. 45 also encodes a 1/2 
BBR and an overhang which will allow polymerization of BN As starting with a Sphi recognition 
site. 

SEQ ID NOS. 46-62 are human papillomavirus (HPV) specific PNAs which, upon 
hybridization with HPV sequences, form TBRs which bind HPV DN A binding proteins. 

SEQ ID NOS. 63-71 are NF-kB DNA recognition units for incorporation into TBAs. 

SEQ ID NO. 72 is a nuclear localization sequence. 

SEQ ID NO. 73 is a SP 1 sequence recognition unit 

SEQ ID NO. 74 is a TATA binding protein recognition unit. 

SEQ ID NOS. 75-84 are papillomavirus E2 DNA recognition units. 

SEQ ID NOS. 85-92 are asymmetry sequences. 

SEQ ID NO. 93 is an arabidopsis TATA binding protein recognition unit 

SEQ ID NO. 94 is an HPV-16-E2-1 DNA binding protein recognition unit. 

SEQ ID NO. 95 is an HPV-16-E2-2 DNA binding protein recognition unit. 

SEQ ID NO. 96 is an HPV-18-E2 DNA binding protein recognition unit 

SEQ ID NO. 97 is an HPV-33-E2 DNA binding protein recognition unit 

SEQ ID NO. 98 is a bovine papillomavirus E2 DNA binding protein recognition unit. 

SEQ ID NOS. 99-102 are exemplary linker sequences. 

SEQ ID NO. 103 is an exemplary nuclear localization signal sequence (NLS). 

. SEQ ID NOS. 104-108 are exemplary chaperone sequences. 

SEQ ID NOS. 109-116 are exemplary assembled TBA sequences. 

SEQ ID NO. 117 is a consensus NF-kB binding site. 

SEQ ID NO. 118 an HIV Tat amino acid sequence. 
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I I I I 1 1 I M 1 1 I I M single stranded nucleic acid 



I I 1 1 I I II 1 1 1 1 I I I double-stranded nucleic acid 

MMMIIIIMIM 

iflllllllllllITT 

I I I I 1 1 I 1 1 1 1 1 1 1 1 binding region an nucleic acid 



no support or indicators, ox solid support, or other means of localization, 
10 including, but not limited to, attachment to beads, polymers, and surfaces, 

or indicators « OS A 
BBA booster binding assembly 

BBR booster binding region 

BNA booster nucleic acid 

IS CNA cousin nucleic acid 

1/2 BBR single-stranded region which, when hybridized to the complementaiy 

sequence from an HNA or a BNA, can bind a BBA 
1/2 TBR single-stranded region of the PNA which, when hybridized to the 

complementary sequence from a TNA, can bind a TBA 
20 OSA optional support or attachment, circle with box 

PNA probe nucleic acid 

TBA target binding assembly 

TBR target binding region 

TNA target nucleic acid 

25 HNA Hairpin Nucleic Acid 



Definitions 

It should also be understood from the disclosure which follows that when mention is made 
of such terms as target binding assemblies (TBAs), booster binding assemblies (BBAs), DNA 
30 binding proteins, nucleic acid binding proteins or RNA binding proteins, what is intended are 

compositions comprised of molecules which bind to DNA or RNA target nucleic acid sequences 
(TNAs) irrespective of the specificity of the category of binding molecules from which they are 
derived Thus, for example, a TBA adapted to bind to human immunodeficiency virus sequences 
may be most similar to an NF-KB transcriptional factor which typically binds DNA sequences. 
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However, as used herein, i! will be understood that the TBA msy be adapted for optimal use to bind 
to RNA sequences of a particular sequence compositioa or conformation. 

The fidelity of the detection method disclosed herein depends in large measure on the 
selective binding of TBAs and BBAs to particular nucleic acid motifs. It should be understood 
throughout this disclosure that the basis of TBA and BBA discrimination of TNAs from related 
sequences (cmsin nucleic acids or CNAs) may be the formation of precise probe nucleic acid (PNA>» 
target nucleic add (TNA) hybrid segments (PNA-TNA hybrids). However, the basis of 
discrimination may just as well be the formation of a particular conformation, and may not require 
the complete absence of mismatchod-base pairing in the TNA-PNA hybrid. Accordingly, the basis 
of TBA or BBA operation should be understood throughout to depend on discrimination of any 
property unique to the TNA-FNA hybrid as opposed to any properties displayed by any PNA-CNA 
hybrids that may be formed in a test sample contacted with a given PNA. 

Detailed Disclosure of the Invention 
The present invention provides a method for specifically identifying a target nucleic acid 
(TNA) in a sample through the use of target binding assemblies (TBAs) which incorporate specific 
nucleic acid binding proteins. By using probe nucleic acids (PNAs) specific to a given TNA 
sequence, and a TBA which is specific to die duplex target binding region (TBR) formed upon 
formation of hybrid TNA-PNA sequences, a stable TBA-TNA-PNA complex is formed. By 
additionally providing specific amplifiable sequences in the PNA, in addition to sequences which 
specifically contribute to the formation of the TBR recognized by the TBA, the binding of the PNA 
to the TNA is detecte d and the detection amplified. Far this purpose, any of a number of nucleic acid 
amplification systems, including polymerase chain reaction, or the use of branched DNA, each 
branch of which contains a detectable label, may be used. In particular, a novel method of 
a mplifi c ati on is described herein where the amplifiable portion of the PNA contains «cqiraccs onto 
which booster nucleic acids (BNAs) may be polymerized. Upon formation of each BNA-PNA 
hybrid, a booster binding region (BBR) is famed to which a booster binding assembly (BBA) binds 
specifically. If detectably labeled, the BBAs or BNAs provide essentially unlimited amplification 
of the original TNA-PNA binding event 

According to this invention, the TNA will be understood to include specific nucleic acid 
sequences. The TBA will be understood to be any molecular assembly which can specifically and 
tightly bind to a formed TNA-PNA hybrid. The TBA will contain one or more molecules whose 
sequences are sufficient to bind to the TBR. Nucleic acid binding domains which are known can 
either be used directly as components of the TBA or modified according to the teachings provided 
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herein The most readily available molecules with such sequences arc the DNA-binding domains of 
DNA-binding proteins. Specifically, many DNA or RNA binding proteins art known which can 
either be used directly as the known, unmodified protein, or the TBA may be a nucleic acid binding 
protein, modified according to the specific teachings provided herein. In the latter case, specific 
5 modifications that are desirable would include optimization of binding affinities, removal of 

unwanted activities (such as nuclease activity and reorganization of the TBA in die presence of other 
molecules with an affinity for components of the TBA), optimization of selectivity of a target 
sequence over closely related sequences, and optimization of stability. 

Examples of DNA binding proteins which could be used according to this invention are the 

10 DNA-birxfag portions of the transcription factor NF-kB (p50 and p65), NF-IL6, NF-AT, rel, TBP, 

the papilloma vims' £2 protein, sp 1, the repressors cro and CI from bacteriophage lambda, and like 
proteins are well known proteins whose DNA binding portion has been isolated, cloned, sequenced, 
and characterized. In addition, any other DNA-binding protein or portion of a protein dial is 
necessary and sufficient to bind to a TBR hybrid or a BBR is included This includes proteins or 

15 portions of wild-type proteins with altered DNA binding activity as well as protein created with 

altered DNA-binding specificity; such as the exchange of a DNA-binding recognition helix from ooe 
protein to another. In addition, proteins which exhibit nucleic acid binding and other nucleic acid 
functions, such as restriction cndonucleases, could be used as the nucleic acid binding function. 
Proteins which bind to target regions in DNA-RNA hybrids as well as RNA-RNA hybrids are 

20 included. (See, for example, Shi 1995, DeStefano 1993, Zhu 1995, Gonzales 1994, Salazar 1993, 

Jaishrce 1993, Wang 1992, Roberts 1992, Kainz 1992, Salazar 1993(b)). The binding assemblies 
may be constructed with the use of a molecule which chaperones portions of the binding assembly 
so that specific component combinations and geometries can be achieved. This molecule is 
designated here as a PILOT. Pilots can be comprised of proteins or any combination of organic and 

25 inorganic materials which achieve the combinatorial selection and/or to induce specific geometries 

between members of the TBA or BBAs. A chaperone is a stable scaffold upon which a TBA or 
BBA may be constructed such that the correct conformation of the TBA or BBA is provided while 
at the same time eliminating undesirable properties of a naturally occurring nucleic acid binding 
protein. As a specific example of this embodiment, a modified version of the pleiotropic 

30 transcription factor, NF-kB, is provided using a modified bacteriophage lambda cro protein as the 
chaperone. Each NF-kB binding dimer retains the picomolar binding affinity for the NF-kB binding 
site while at the same time the binding assembly presents several advantageous manufacturing, 
stability, and specificity characteristics. 
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In view of the forgoing, the various aspects and embodiments of this invention are 
described below in detail. 

1. The Probe Nucleic Acids fPNAs) and their preparation . The PN As of the present 
invention comprise at least dtoprincpal parts joined together. With reference to Figure 1(1) of the 

5 Hrmvmgg tte first part nfthr PHA krwnr mrww »pignra»g nf h»c*c /Wfli^f^ **!/? TPR " With 

reference to Figure 1(1 and Ha) of the drawings, the 1/2 TBR in the PNA is complementary to a 
sequence of interest in a sample, the TNA containing a 1/2 TBR. With reference to Figure l(IIIa) 
of the drawings, the TNA, when added to the PNA under hybridizing conditions, forms a PNA-TNA 
hybrid containing a TBR. With reference to Figure l(I)of the drawings, the second part of the PNA 

10 is a sequence ofbascs, designated" 1/2 BBR." With reference to Figure 1(1, lib, lie, and IV a) of the 
drawings, the 1/2 BBR in the PNA is complementary to a 1/2 BBR contained in a BNA or a HNA. 
With reference to Figure l(IHb, DBc, and Va) of the drawings, the BNA or HNA, when added to the 
PNA under hybridizing conditions, Conns a PNA-BNA hybrid or PNA-HNA hybrid, respectively, 
containing a BBR. With reference to Figure 1(1) of the drawings, the third part of the PNA is the 

15 OS A, designated by a circle with a box around it The OSA is no support and/or an indicator, or 
solid support, or other means of localization, including but not limited to, attachment to beads, 
polymers, and surfaces and/or indicators which is/are covalently wttached to, or non-covalently, but 
specifically, associated with the PNA. The OSA may be an atom or molecule which aids in the 
separation and/or localization such as a solid support binding group or label which can be detected 

20 by various physical means including, but not limited to, adsorption or imaging of emitted particles 
or light Methods for attaching indicators to oligonucleotides or for immobilizing oligonucleotides 
to solid supports are well known in the art (see Kens' and Manak, supra, herein incorporated by 
reference). 

The PNA of the present invention can be prepared by any suitable method. Such methods, 
25 in general, will include oligonucleotide synthesis and cloning in a replicable vector. Methods for 

nucleic add synthesis are wdl-known in the art When cloned or synthesized, strand purification and 
separation may be necessary to use the product as a pure PNA. Methods of preparing RNA probes 
are well known (see for example Blais 1993, Blais 1994, which uses in vitro transcription from a 
PCR reaction incorporating a 17 RNA polymerase promoter). 
30 The length and specific sequence of the PNA will be understood by those skilled in the art 

to depend on the length and sequence to be detected in a TNA, and the strictures for achieving tight 
and specific binding of the particular TBA to be used (see discussion on TBAs below). In general, 
PNAs of sequence lengths between about 10 and about 300 nucleotides in length are adequate, with 
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lengths f about 15-100 nucleotides being desirable for many of the embodiments specifically 
exemplified herein. 

It should also be understood that the PNA may be constructed so as to contain more than 
one 1 n TBR and to produce more than one TB R for one or more TB As , same or different, as well 
as complex TBRs recognized by novel duplex and multiplex TBAs (see description below regarding 
these novel TBAs) upon hybridization of the PNAs and TNAs. Figure 5 illustrates specific PNAs 
which contain one or more 1/2 TBRs. Specific sequences which correspond to the 1/2 TBR 
sequences illustrated in Figure 5 (la, Da, IDa, IVa, and Va) are SEQ ID NOS. 1 -34 (see Description 
of Sequences above). 

As shown in Figures 2a and 2b, the PNA, containing a 1/2 TBR, may be hybridized with 
one or more BNAs (see description below) and the chain of BNAs polymerized to any desired 
potential length for amplification of the TNA-PNA hybridization event Preferably, between about 
0 and about 10 1/2 BBRs will be present in the PNA. 

As shown in Figures 6a and 6b, the PNA may contain several 1/2 TBRs, same or different, 
which can hybridize with several 1/2 TBRs in a TNA. Each time a 112 TBR in the PNA matches 
a 1/2 TBR in a TNA, a Target Binding Region, TBR, is formed which can bind a TBA. 
Furthermore, it is not essential that all of the TBRs be on a single, contiguous PNA. Thus, in one 
embodiment of the invention, two different PNAs are used to detect sequences on a particular TNA. 
As an illustration of this aspect of the invention, Figure 7 shows one representation of the human 
immunodeficiency virus (HIV) long terminal repeat (LTR). As is known in the art, the HTV LTR 
comprises two NF-kB binding sites and three SP 1 binding sites, in close proximity, wherein NF-kB 
and SP1 arc known DNA binding proteins. Figure 7 provides two PNAs, PNA1 (SEQ ID NO. 38) 
and PNA2 (SEQ ID NO. 39), each of which is complementary to the opposite strand shown as a 
TNA (SEQ ID NO. 37), which shows the two NF-kB binding sites and the three SP1 binding sites 
of the MV LTR According to this aspect of the invention, PNA1 specifically hybridizes with that 
section of the TNA shown in Figure 7 with bases underscored with a symbol, while PNA2 
specifically hybridizes with that section of the TNA shown in Figure 7 with bases underscored with 
an symboL Each ofPNAlorPNA2 may also contain sequences (indicated by the symbols **#" 
or which will hybridize with a BNA's 1/2 BBR sequences (see below). In addition, each of 
PNA1 and PNA2 may be differentially tagged with an OSA, such as a fluorophore such as a 
fluorescein or a rhodamine label, which would allow confirmation that both probes have become 
bound to the TNA. If only one label or neither label is detected, it is concluded that the TNA is not 
present in the sample being tested. 
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In a further aspect f the embodiment shown in Figure 7, a method for altering the 
specificity of the instant assay method is shown. By changing the length f the gap between PNA1 
and PNA2, such that the region of TNA remaining unhyhridized is altered, one practicing this 
invention is able to alter the discrimination of the assay. 
5 In order to more clearly exemplify this aspect of the invention, it is necessary to emphasize 

that the TBR may have a helical structure. Thus, while PNAI creates TBRs oo one "face" of the 
helix, PNA2 creates a TBR on either the same or a different face of the helix, depending on the 
distance between the middle of each TBR (unrttrimeri in Figure 7). If the middle of each binding site 
is an integral product of 10 J bases apart, the TBRs will be on the same side of the helix, while non- 
10 integer products of 10.5 bases apart would place the TBRs on opposite sides of the helix. In this 
fashion, any coopcrativity in binding by the TBA recognizing the PNAI TBR and the TBA 
recognizing the PNA2 TBR can be manipulated (see Hochschild, A., M. Ptashne [1986] Cell 
44:681-687, showing this effect for die binding of bacteriophage lambda repressor to two different 
operator sites located at different distances from each other in a DNA helix). As described by 

15 Peddnse/al ([1993] £MBO J. 12:355 1-3558), oooperativity between NF-kB and the SP1 sites is 
required to achieve activation of the HIV LTR. However, for the purpose of the instant invention, 
the double NF-kB-triplc SP1 binding site motif in die HIV LTR may be taken advantage of by 
providing a single, novel binding protein capable of binding both sites simultaneously, but only if 
the spacing between the sites is geometrically feasible. This is controlled both by the structure of 

20 the selocted TBA arid by the PNAs used. Thus, in the embodiment exemplified in Figure 7, the two 
probes may be used with a large enough interprobe region of single-stranded DNA remaining such 
that, even if the NF-kB and SP1 binding sites are on opposite sides of the helix, the single-stranded 
region between the probes provides a sufficiently flexible "hinge" so that the DNA can both bend 
and twist to accommodate the geometry of the TBA . Alternatively, a more stringent assay may be 

25 designed by narrowing the interprobe distance such that the DNA may only bend, but not twist 

Finally, the probes may be so ck>sely spaced, or a single PNA used, such that the DNA can only bend 
but not twist Thus, this figure exemplifies and enables the production of detection systems with any 
given desired degree of discrimination between target nucleic acids having similar sequences, but 
different juxtapositions of these sequences. 

30 In terms of a diagnostic a forensic kit for HIV, those skiUod in the art would understand that 

the aforementioned aspects of this invention allow for the tailoring of the components of the 
diagnostic or forensic kit to match what is known at any given time about the prevalent strains of 
HIV or another pathogen or disease condition. It will also be appreciated by those skilled in the art 
that, while detection of HIV infection is not the only utility of the instant invention, due to the 
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mutability of the HIV genome, it is probably one of the most complex test environments for such a 
diagnostic. It is precisely in such a mutable environment, however, where the flexibility of the 
instant method, coupled with its ability to discriminate between very closely related sequences, may 
be most dearly appreciated In less mutable environments, some of the sophistication to which this 
5 invention is amenable need not be utilized. Thus, in a diagnostic kit for papillomavirus infection, 

all of the discrimination characteristics of die TBA-TBR interaction are available, along with the 
ability to amplify the signal using the BNAs and BBAs, but a single, simple PNA, such as any one 
of SEQ ID NOS. 46-62, may be used which identifies unique papillomavirus sequences, which also 
are known to bind to a TBA such as the papillomavirus E2 protein or truncated DNA binding 
10 portions thereof (see Hegde et al [1992] Nature 339:505-512; Monini et al [1991] J. Virol 

65:2124-2130). 

In applying the instant method to the detection of a particular TNA for the purposes of 
assessing whether certain nucleic acids arc present which are associated with the progression f 
melanoma, hepatoma, breast, cervical, lung, colon, prostate, pancreatic or ovarian cancers, the TNA 

15 may be obtained from biopsy materials taken from organs and fluids suspected of containing the 

cancerous cells. For the detection of genetic deficiencies, the TNA may be obtained from patient 
samples containing the affected cells. For detection of fermentation contaminants and products in 
the manufacture of food, chemical or biotechnology products or in the bioremediaticn of wastes, the 
TNA may be obtained from samples taken at various stages in the fermentation or treatment process. 

20 For detection of food or drug pathogens or contaminants , the TN A sample may be obtained from the 

food or drug, swabs of food or surfaces in contact with the food, fluids in contact with the food, 
processing materials, fluids and the like associated with the manufacture of or in contact with the 
food, drug, or biological samples taken from those in contact with the food or drug or the like. 

2 The Booster Nucleic Acids fBNAsV Booster Binding Regions (BBRsl and their 

25 preparation. The BNAs of the present invention are comprised of at least one or more 1/2 BBRs 

coupled to an OSA. The 1/2 BBRs can hybridize to complementary 1/2 BBRs contained in the 
PNA, other BNAs or an HNA. 

With reference to Figure 1 (I, lib and Mb) of the drawings, the simplest BNA is comprised 
of two parts. With reference to Figure l(IIb) of the drawings, the first part of the simplest BNA is 

30 a sequence of bases which is complementary to the sequence in the PNA which is designated 44 1/2 

BBK" With reference to Figure l(IIb) of the drawings, the second part of the simplest BNA is the 
OSA, (tesignatcd by a circle with a box around it The OSA is no support and/or indicator, or solid 
support, or other means of localization, including but not limited to, attachment to beads, polymers, 
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and surfaces and/or indicators which are covalcntly attached to, or non-covalcntly, but specifically, 
associated with the BNA. 

With reference to Figure 2a(II and ID) of the drawings, the BNA may contain more than one 
1/2 BBR sequence. The BNA illustrated in Figure 3(11) contains a mpirntr which is 
5 complementary to the PNA illustrated in Figure 3(1) and two other 1/2 BBR sequences. The BNA 

illustrated in Figure 3(HI) contains two 1/2 BBR sequences which are complementary to two of the 
1/2 BBR sequences in the BNA illustrated in Figure 3(11), plus up to *V additional 1/2 BBRs for 
polymerization of additional BNAs. 

Undabybhdizb^onditi the BNA illustrated in Figure 3(H), when combined with the 
10 PNA illustrated in Figure 3(1), creates the PNA-BNA hybrid illustrated in Figure 3(TVa) containing 

a BBR and an unhybridized extension with two additional 1/2 BBR sequences or "booster" 
sequences. The BBRs created by said hybridization can be identical, similar or dissimilar in 
sequence. The BBRs created by said hybridization can bind identical, similar or dissimilar BBAs 
(see below). The BNAs may have prepared analogously to the PNAs. 
15 Under hybridizing conditions, the BNA-BNA hybrid illustrated in Figure 3(TVb), when 

combined with die PNA illustrated in Figure 3(Vb), creates die PNA-BNA hybrid illustrated in 
Figure 3 (VI) containing a BBR, two additional BNA-BNA hybrids containing BBRs, and an 
unhybridized extension with an additional 1/2 BBR sequence, a "booster" sequence. The BBRs 
created by said hybridization can be identical, similar or dissimilar in sequence. The BBRs created 
20 by said hybridization can bind identical, similar or dissimilar BBAs (see below). The BNAs may 

be prepared in a fashion analogous to preparation of the PNAs. 

3. The Tarpct Nucleic Acids (TNAs) and their preparation . The first step in detecting and 
amplifying signals produced through detection of a particular TNA according to the present method 
is the hybridization of such target with the PNA in a suitable mixture. Such hybridization is 
25 achieved under suitable conditions well known in the art 

The sample suspected or known to contain the intended TNA may be obtained from a 
variety of sources. It can be a biological sample, a food or agricultural sample, an environmental 
sample and so forth. In applying the instant method to the detection of a particular TNA for the 
purposes of medical diagnostics or forensics, the TNA may be obtained from a biopsy sample, a 
30 body fluid or exudate such as urine, blood, milk, cerebrospinal fluid, sputum, saliva, stool, lung 
aspirates, throat or genital swabs and the like In addition, detection may be in situ (see for example 
Embretson 1993; Patterson 1993; Adams 1994). 
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15 



20 



25 



30 



35 



40 



45 



Accordingly, PNAs specific to vertebrates (including mammals and including humans) or 

to any or all of the following microorganisms of interest may be envisioned and used according to 

the instant method: 

Cqrynebftffmfl 

Corynebocterium diphtheria 

Bacillus thuringiensis 

Diplococcus pneumoniae 

Streptococcus pyogenes 

Streptococcus salivahus 
Staphylococcus 

Staphylococcus aureus 

Staphylococcus albus 
Pseudomonas 

Pscudomonas stutzen 

NwSffift 

Neisseria meningitidis 

Neisseria gonorrhea 
folti^pfrafferiaceae 

Escherichia coli 

Aerobacteria aerogcnes 

Klebsiella pneumoniae 

Salmonella typhosa 

Salmonella choleraesuis 

Salmonella typhimurium 

Shigellae dysenteriae 

Shigellae schmtai 

Shigellae arabinotarda 

Shigellae flexneri 

Shigellae boydit 

Shigellae sonnei 
Other enteric bacilli 

Proteus vulgaris 

Proteus mirabilis 

Proteus morgani 

Pseudomonas aeruginosa 

Alcaligenes faecalis 

Vibrio cholerae 
Hcmophilus-Bordctella group 

Hemophilus influenza. H. ducryi 

Hemophilus hemophilus 

Hemophilus aegypticus 

Hemophilus parainfluenzae 

Bordetella pertussis 
Pasteurellae 

Pasteurella pestis 

Pasteurella tulareusis 



The colifonn bacteria 
The Salmonellae 



The Shigellae 



Proteus species 



50 
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Brucella melitensis 

Brucella abortus 

Brucella suis 
5 Aerobic Snore-Forming Bacilli 

Bacillus anthracis 

Bacillus subtilis 

Bacillus megatertum 

Bacillus cereus 
10 Anrergfrc Spcro-FprminR BftaHj 

Clostridium botulinum 

Clostridium teiani 

Clostridium perfringens 

Clostridium novyi 
IS Clostridium septicum 

Clostridium histolytiaim 

Clostridium tertium 

Clostridium bifermentans 

Clostridium sporogenes 

20 Mvrebftttcrifl 

Mycobacterium tuberculosis hominis 

Mycobacterium bovis 

Mycobacterium avium 

Mycobacterium leprae 
25 Mycobacterium paratuberculosis 

AcPwmwflcff (fimgus-like bacteria) 

Actinomyces isaeli 

Actinomyces bovis 

Actinomyces naeslundii 
30 Nocardia asteroides 

Nocardia brasiliensis 
The Spirpchttcg 

Treponema pallidum 

Treponema pertenue 
35 Treponema carateum 

Spirillum minus 

Streptobacillus moniliformis 

Borrelia recurrens 

Leptospira icterohemorrhagiae 
40 Leptospira canicola 

TryppjiasCTPps 

Mygoptasmps 

Mycoplasma pneumoniae 
Other ppftOgCM 
45 Listeria monocytogenes 

Erysipelothrix rhusiopathiae 

Streptobacillus moniliformis 

Donvania granulomatis 

Bartonella bacillformis 

50 
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Rickettsiae (bacteria-like parasites) 
Rickettsia prowazeiai 
Rickettsia mooseri 
Rickettsia rickettsii 
5 Rickettsia conori 

Rickettsia australis 
Rickettsia sibiricus 
Rickettsia akari 
Rickettsia tsutsugamushi 
10 Rickettsia burnetti 

Rickettsia quintana 
Chlamydia (unclassifiable parasites bacterial/viral) 
Chlamydia agents (naming uncertain) 

E&mgL 

Cryptococcus neoformans 
Blastomyces dermatidis 
Histoplasma capsulatum 
Coccidioides immitis 
Paracoccidioides brasiliensis 
Candida albicans 
Aspergillus fumtgatus 
Mucor corymbifera {Abstdia corymbifera) 
Rhizopus oryzae 

Rhizopus arrhizus Phycomycetes 
Rhizopus nigricans 
Sporotrichum schenhi 
Flonsecaea pedrosoi 
Fonsecaea compact 
Fonsecacae dermatidis 
Cladosporium carrioni 
Phialophora verrucosa 
Aspergillus nidulans 
Madurella mycetomi 
Madurella grisea 
Allescheria boydii 
Phialophora jeanselmei 
Microsporum gypsum 
Trichophyton memagrophytes 
Keratinomyces ajelloi 
Microsporum cams 
Trichophyton rubrum 
Microsporum adouint 

Viruses 

Adenoviruses 
Herpes Viruses 

Herpes simplex 
Varicella (Chicken pox) 
Herpes zoster (Shingles) 
Virus B 

Cytomegalovirus 
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Poxviruses 



5 



Variola (smallpox) 
Vaccinia 
Poxvirus bovis 
Paravaccinia 
Molluscum contagiosum 



picomgviruscs 



10 



Poliovinis 
Coxsackievirus 
Echoviruscs 
Rhinoviruses 



Mvxovinises 



15 



Influenza (A, B, and C) 
Parainfluenza (1-4) 
Mumps virus 



Newcastle disease virus 
Measles virus 
Rinderpest virus 
Canine distemper virus 
Respiratory syncytial virus 
Rubella virus 

Eastern equine encephalitis virus 
Western equine encephalitis virus 
Sindbis virus 
Chikugunya virus 
Semliki forest virus 
Mayora virus 

St. Louis encephalitis virus 
California encephalitis virus 
Colorado tick fever virus 
Yellow fever virus 
Dengue virus 

Reovirus types 1*3 



Human immunodeficiency viruses (HIV) 
Human T-cdl lymphotrophic virus I & II (HTLV) 

Hcpptifo 

Hepatitis A virus 

Hepatitis B virus 

Hepatitis nonA-nonB virus 

Hepatitis, C.D.E 
Tumor viruses 

Rauscher leukemia virus 

Gross virus 

Maloney leukemia virus 
Human papilloma viruses 
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It would be understood by one of skill in the art that it is generally required to treat samples 
suspected of containing a particular TNA in such a fashion as to produce fragments that can easily 
hybridize with the PNA. It may be necessary to treat the test sample to effect release of or to extraa 
the TNA for Irybridizarjon, such as by exposing blood or other cells to a hypotonic environment, or 
5 otterwisc disrupting the san^leusi^ means. When the TNA is thought to be present 

in double stranded form, it would naturally be desirable to separate the strands to render the TNA 
bybridizable in single stranded form by methods well known in the art, including but not limited to 
heating or limited exposure to alkaline conditions which may be neutralized upon addition of the 
single stranded PNA to allow hybridization to occur. Methods for preparing RNA targets are well 
10 known (see Watcrhousc 1993, Mitchell 1992). 

Fragmentation of nucleic acid samples containing TNAs is usually required to decrease the 
sample viscosity and to increase the accessibility of the TNAs to the PNAs. Such fragmentation is 
accomplished by random or specific means known in the art Thus, for example, specific nucleases 
known to cut with a particular frequency in the particular genome being analyzed, may be used to 
15 produce fragments of a known average molecular size. In addition, other nucleases, 
phosphodiesterases, cxonucleascs and endonudeases, physical shear and sonication are all methods 
amenable for this purpose. These processes are wdl known in the art The use of restriction 
enzymes for the purpose of DNA fragmentation is generally preferred. However, DNA can also be 
fragmented by a variety of chemical means such as the use of the following types of reagents: 
20 EDTA-Fe(II) (according to Stroebd et al [1988] X Am. Chem. Soc. 1 10:7927; Dervan [1986] 

Science 232:464); Cu(II>phenanthro1irte (according to Chen mid Sigman [1987] Science 237: 1 197); 
class IIS restriction enzyme (according to Kim et al [1988] Science 240:504); hybrid DNAse 
(according to Corey et al [1989] Biochem. 28:8277); bleomycin (according to Umezawa et al. 
[1986] J. Antibiot. (Tokyo) Ser. A, 19:200); neocarzinostatin (Goldberg et al [1981] Second 
25 Annual Bristol-Myers Symposium in Cancer Research, Academic Press, New York, p. 1 63); and 
mdhidiunipropy]-EDTA-Fe(II) (according to Hertzberg et al [1982] J. Am Chem. Soc 104:313). 
Removal of proteins, as by treatment with a protease, is also generally desirable and methods for 
effecting protein removal from nucleic acid samples, without appreciable loss of nucleic acid, are 
well known in the art 

30 The TNAs of the present invention should be long enough so that there is a sufficient 

amount of double-stranded hybrid flanking the TBR so that a TBA can bind unperturbed by the 
unligated fragment ends. Typically, fragments in the range of about 10 nucleotides to about 100,000 
nucleotides, and preferably in the range of about 20 nucleotides to about 1 ,000 nucleotides are used 
as die average size for TNA fragments. Examples of specific TNA sequences that could be detected 
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art sequences complementary to the PNA sequences described herein for detection of normal 
cellular, abnormal odtolnr (as in activated oncogenes, integrated foreign genes, genetically defective 
genes), end pathogen-specific nucleic acid sequences, for which specific nucleic acid binding proteins 
are known, or which can be produced according to methods described in this disclosure. With 
5 reference to Figure 7, a specific HTV-related TNA is shown as S EQ ID NO. 3 7. 

4. Extensions to the PNA using BNAi their preparation, nod signal amplification. Under 
hybridizing conditions, BNAs can be added that hybridize to the PNAs, PNA-BNA hybrids, BNAs 
and/or BNA-BNA hybrids. The afijremcntionod additions can be made in a non*voctorial polymeric 
fashion or in a vectorial fashion, with a known order of BNAs. 
10 With reference to Figure 2a, a simple booster is presented A booster polymer is produced 

by adding two BNAs, illustrated in Figure 2a(Ib and Ic), which when combined under hybridizing 
conditions with the PNA, form PNA-BNA-BNA hybrids, comprised of the PNA and "booster" 
extensions", illustrated in Figure InfU&JIbflc and Hd) leaving at least one unpaired 1/2 BBR 
sequence. Each impaired 1/2 BBR sequence, illustrated in Figure 2a(IIa, lib, lie, lid) can hybridize 
IS with additional BNAs to form additional "booster* extensions. Each my aired 1/2 BBR sequence, 
illustrated in Figure 2a(HaJIbJIc and Dd) can hybridize with added HNAs, illustrated in Figure 
2a(IQaandinb). The hybridization of the HNAs, which cannot hybridize additional BNAs, acts to 
u cap" the addition of the BNAs onto the PNA, as illustrated in Figure 2a(IVa, IVb, IVc and IVd). 
With reference to Figure 2b, it is possible to control and specify the order and components 
20 of extensions to the PNA. If a single BBR is required, a HNA containing the complementary 
sequence to the 1/2 BBR in the PNA is added to the PNA to produce a single BBR and to "cap" any 
"booster" extensions to the PNA. If additional BBRs are to be added to the PNA, a controlled 
extension of the PNA can be accomplished. 

With reference to Figure 2b t a simple booster is presented. Vectorial polymer extension is 
23 accomplished by adding a BNA which is specific for the PNA, as illustrated in Figure 2b(Ia and Ha), 

which when combined under hybridizing conditions with the PNA, form PNA-BNA-BNA hybrids, 
comprised of the PNA and "booster'' extensions. These extensions, if labeled with an OSA, provide 
a method for greatly amplifying any signal produoed upon binding of a PNA to a TNA in the sample. 
Furthermore, by binding labeled BBAs to the BBRs in the polymer, additional amplification is 
30 achieved. 

Any of a number of methods may be used to prepare the BNAs, including, e.g., synthesis 
via known chemistry or via recombinant DNA production methods. In the latter method, an 
essentially unlimited number of BNAs may be produced simply and inexpensively, for example, by 
production in prokaryotes (£ coli for example) of a plasmid DNA having multiple repeats of the 
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specific BNA sequences flanked by restriction sites having overhanging ends. In this fashion, for 
example, the bacteriophage lambda left or right operator sites, or any other DNA or other nucleic 
acid sequence known to specifically and tightly bind a particular BBA, such as a DNA or RNA 
binding protein, may be produced in an essentially unlimited number of copies, with each copy 
5 flanked by an EcoRl, PsA, BamHl or any of a number of other common restriction nuclease sites. 

Alternatively, a polymer at repeated sites may be excised by unique restriction sites not present 
within the polymer. Large quantities ofpBR322, pUC plasmid or other plasmid having multiple 
copies of these sequences are produced by methods well known in the art the plasmid cut with the 
restriction enzyme flanking the polymerized site, and the liberated multiple copies of the operators 
10 isolated either by dranalognspby or any others The BNA, prior 

to use, is then strand separated and is then amenable for polymerization onto a PNA encoding a 
single stranded complementary copy of the operator as a 1/2 BBR. The BN As may be polymerized 
voctorially onto die PNA by using different restriction enzymes to flank each repeat of the polymer 
in the plasmid used to produce multiple copies of the BNA. Alternatively, the BNA polymer may 
15 be hybridized to the PNA via overhangs at one or both ends of the BNA polymer, without the need 

to strand separate and anneal each BNA segment Examples of specific BNA sequences are 
provided above in the section entitled Description of Sequences, as SEQ ID NOS. 35-36. To 
stabilize the BNA polymer, DNA ligasemay be used to covalently link the hybridized BNAs. 

5. The Hairpin Nucleic Acids (HNAs) and their preparation. The HNAs of the present 
20 invention comprise at least two principal parts joined together: A single-stranded sequence, which 

is complementary to a 1/2 BBR, and a double-stranded nucleic acid region formed, under hybridizing 
conditions, by the self-association of self-complementary sequences within the HNA. With reference 
to Figure l(IIc) of the drawings, the 1/2 BBR in the HNA may be constructed so as to be 
complementary to die 1/2 BBR sequence in the PNA. With reference to Figure 1(1, He and Ilk) of 
the drawings, the aforementioned HNA, when added to the PNA under hybridizing conditions, forms 
a PNA-HNA hybrid containing a BBR. With reference to Figure l(IIlc, IVc and Vc) of the 
drawings, a PNA-HNA hybrid, under hybridizing conditions, upon addition of the TN A, can form 
a TN A-PNA-HNA hybrid containing a TBR and a BBR. 

With reference to Figure 2a and 2b, the HNAs can be used to "captor terminate the addition 
of BNA extensions to the PNA. The two BNAs in Figure 2a(Ib and Ic) can associate to form the 
hybrid shown in Figure 3(IVb) or can hybridize directly and individually to the PNA as illustrated 
in Figure 2a(Ia-c, Ila-d). The two HNAs (shown in Figure 2a(IIIa and Mb)) can terminate the 
hybridization of the BNA to other BNAs which extend from the PNA, as illustrated in Figure 2a 
(IVa-d). The HNA in Figure 2a(IDa) can terminate the PNA-BNA hybrids shown in Figure 2a(Ub 
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and Ed) and any PNA-BNA hybrid with 8 single stranded 1/2 BBR which is complementary to the 
1/2 BBR in the HNA illustrated in Figure 2a(IIIa). Similarly, the HNA in Figure 2a(IHb) can 
terminate the PNA-BNA hybrids shown in Figure 2a(IIa and Dc) and any PN A -BNA hybrid with two 
single stranded 1/2 BBRs which are complementary to the 1/2 BBRs in the HNA illustrated in 
Figure 2a(HIb). 

HNAs art constructed that will terminate PNA-BNA hybrids which are constructed from 
the sequential addition of BNAs to the PNA as illustrated in Figure (2b). The single stranded 1/2 
BBR sequences illustrated in Figure 2b(Ia, IOa, Va, and Vila) are specifically complementary to the 
single stranded 1/2 BBR sequences illustrated in Figure 2b(Ib r IIIb % Vb arxi Vllb) and produce the 
unique capped PNA-BNA-HNA hybrids illustrated in Figure 2b(IcJIIcVc and VDc). 

The sctf<ocnplaTicntary sequences in the HNA and the loop sequence which links the self- 
complementary hairpin sequences can be of any composition and length, as long as they do not 
substantially impede or inhibit the presentation of the single-stranded 1/2 BBR that comprises part 
of the HNA by the HNA or selectively bind the BBA or the TBA. The loop sequences should be 
selected so that formation of the loop does not impede formation of An examples of an 

HNA useful in tins application is provided as SEQ ID NO. 44 (see Description of Sequences above). 

$ t The Target Binding Asymfrhcs (TPA$) fftfj ti^r reparation. A TBA may be any 
substance which binds a particular TBR formed by hybridization of particular TNAs and PNAs, 
provided that the TBA must have at least the following attributes : 

(a) The TBA must bind the TBR(s) in a iashion that is highly spocific to the TBR(s) 
ofinterest Tfart is,theTBAmustdiscrir^ 

PNA hybrid and similar duplex sequences formed by PNA-CNA hybrids. The 
TBA must bind the PNA-CNA hybrid with a sufficiently low avidity that upon 
washing the TBA-TNA-PNA complex, the PNA-CNA hybrid is displaced and the 
PNA-TNA hybrid is not displaced; 

(b) The TBA must avidly bind the TBR(s) orated by the hybridization of the TN A 
with the PNA. Binding affinities in the range of 10" 5 to about 10 " or higher are 
generally considered sufficient. As noted below, in some instances, it might be 
desirable to utilize a particular TBA which has a very low avidity for a particular 
TBR, but which has a greatly increased affinity when a particular configuration of 
multiple TBRs is provided so that the square of the affinity of the TBA for each 
TBR becomes the affinity of relevance to that particular TBA. 

Examples of the DNA binding components useful in the formation of TBAs include, but are 
not limited to NF-kB, papillomavirus £2 protein, transcription factor SP1, inactive restriction 
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enzymes, antibodies, etc. Each of these proteins has been recognized in the art to contain sequences 
which bind to particular nucleic acid sequences and the affinities of these interactions are known. 
Naturally, die method of the instant invention is not limited to die use of these known DNA binding 
proteins or fragments thereof Fran the instant disclosure, it would be apparent to one of ordinary 
5 skill that the instant method could easily be applied to the use of novel TBAs exhibiting at least the 
required attributes noted above. Thus, for example, in WO 92/20698, a sequence specific DNA 
binding molecule comprising an oligonucleotide oonjugate formed by the covalent attachment of a 
DNA binding drug to a triplex forming oligonucleotide was described The method of that 
disclosure could be used to produce novd 

10 that the TBAs thus formed meet the criteria described above. In addition, the methods of U.S. Patent 
Nos. 5,096,815, 5,198,346, and WO88/06601, herein incorporated by reference, may be used to 
generate novel TBAs for use according to the method of this invention. Specific antibodies or 
portions thereof could be used (see for example Blais 1994). 

Where the TBA is a protein, or a complex of proteins, it will be recognised that any fa 

15 number of methods routine in the art may be used to produce the TBA. The TBA may be isolated 
from its naturally occurring environm^ is impractical, produoed by the standard 

techniques of molecular biology. Thus, using NF-kB as an example, using the DNA binding 
portions of p50 or p65 submits, this binding assembly could be produced according to recombinant 
methods known in the art (see fir example Ghosh [1990] Cell 62: 1019-1029, describing the cloning 

20 of the p50 DNA binding subunit of NF-kB and the homology of that protein to rel and dorsal). 

Many DNA and other nucleic acid binding proteins are known which can be used as or in 
TBAs according to this invention. Once the amino acid sequence of any DNA, RNA.DNA, RNA 
or other nucleic acid binding protein is known, an appropriate DNA sequence encoding the protein 
can either be prepared by synthetic means, or a cDNA copy of the mRNA encoding the protein from 

25 an appropriate tissue source can be used Furthermore, genomic copies encoding the protein may 

be obtained and introns spliced out according to methods known in the art Furthermore, the TBAs 
may be chemically synthesized. 

Once an appropriate coding sequence has been obtained, site-directed mutagenesis may be 
used to alter the amino acid sequence encoded to produce mutant nucleic acid binding proteins 

30 exhibiting more desirable binding characteristics than those of the original nucleic acid binding 
protein. As an example of this process, the amino acid sequence of the DNA binding portions of 
NF-kB can be altered so as to produce an NF-kB' molecule which more tightly binds the NF-kB 
binding site (see examples below - HTV-Detect and HIV-Lock). 
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To provide further insight into this aspect of the invention, the following considerations arc 
to be noted Using NF-kB as an example, a TB A may be prep ared using the naturally occurring NF - 
kB molecule. Howeva, because this molecule is present in vanishingly small quantities in cells, and 
because the subunits of this DNA binding protein have been cloned, it would be more reasonable to 
prepare large quantities of the complex via recombinant DNA means as has already been 
accomplished for this protein (see for example Ghosh [1990] Ce// 62:1019-1029). NF-kB is a 
pkkrtropic induoer of genes involved in immune, inflammatoty and growth regulatory responses to 
primary pathogenic (viral, bacterial or stress) challenges or secondary pathogenic (inflammatory 
cytokine) challenges. NF-kB is a dxmeric DNA binding protein comprising a p50 and a p65 subunit, 
both of which contact and bind to specific DNA sequences. In an inactivated state, NF-kB resides 
in the cellular cytoplasm, complexcd yrith a specific inhibitor, I-kB, to farm a cytoplasmic 
heterotrimer. Upon activation, the inhibitor is decomplexed, and the p50-p65 dimer relocates via 
a specific nuclear localization signal (NLS) to die cell's nucleus where it can bind DNA and effect 
its role as a Uauscriptional activator of numerous genes (see Grimm and Baeuerie [1993] Biochem 
J. 290:297-308, far a review of die state of the art regarding NF-kB). 

The p50-p65 dimer binds with picomolar affinity to sequences matching the consensus 
GGGAMTNYCC (SEQ ID NO. 117), with slightly different affinities depending on the exact 
sequence. It is worth noting that homodimers of p50 and p6S have also been observed to occur. 
These homodimers display different biochemical properties as well as slightly different affinities of 
binding sequences within and similar to the above consensus. Thus, depending on the desired 
binding characteristics of the TBA, a p50-p65 heterodimer, a p5O-p50 homodimer, or a p65-p65 
homodimer or fragments of the aforementioned dimcrs may be used. 

One way in which various novel TBAs may be produced far use according to this inventioo 
is shown schematically in Figure 9. The nucleic acid recognition units of the TBA may be assembled 
and associated with similar or dissimilar TBA nucleic acid recognition units via a "chaperonc." The 
chaperane is a structure on winch the various TBA recognition elements are built and which confers 
desirable properties on the nucleic acid recognition units. The chaperane is comprised of any 
sequence which provides assembly sequences such that same or different nucleic acid recognition 
units are brought into close and stable association with each other. Thus, for example, in the case 
of a TBA designed to tightly bind NF-kB TBRs, a TBA is assembled by providing lambda cro 
sequences as assembly sequences, linked to the nucleic acid binding sequences for either NF-kB p5 0 
orp65. The p50 or p65 nucleic acid binding sequences are linked to the cro sequences at either the 
carboxy or amino terminus of cro and either the carboxy or amino terminus of the nucleic acid 
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recognition unit of the pSO or p65. Linking sequences arc optionally provided to allow appropriate 
spacing of the nucleic acid recognition units for optimal TBR binding. 

The assembly sequences, exemplified above by cro and CI sequences (SEQ ID NOS. 104- 
1 08), oomprise any stable oligopeptides which naturally and strongly bond to like sequences. Thus, 
5 in the case of cro t it is well known that a dimcr of cro binds to the bacteriophage lambda operator 
sites (Anderson et al. [1981] Nature 290:754-758; Harrison and Aggarwal [1990] Ann. Rev. 
Biochem. 59:933-969). The mrnner units of cro tightly and specifically associate with each other. 
Thus, by linking DNA recognition unit sequences to the cro sequences, close and tight association 
is achieved. 

10 The optional linker sequences comprise any amino acid sequence which does not interfere 

with TB A assembly or nucleic acid binding, and which is not labile so as to liberate the nucleic acid 
recognition unit from the complete TB A. It is desirable but not necessary that the Linker sequences 
be covalently linked to other binding assembly components. The association should be specific so 
as to aid in the assembly and manufacture of the binding assemblies. Examples of such sequences 

15 include, but are not limited to, such well known sequences as are found linking various in 

structural proteins. Thus, for example, in the lambda repressor protein, there is a linking sequence 
between the DNA binding domain and the dimeri ration domain which is useful for this purpose. 
Many other such sequences are known and the precise sequence thereof is not critical to this 
invention, provided that routine experimentation is conrhtrtnri to ensure stability and non-interference 

20 with target nucleic acid binding. Examples of such sequences are provided herein as Met Ser and 

SEQ IN NOS. 99-102. Insertion of specific, known proteolysis sites into these linkers is als an 
integral part of this invention. The presence of such sites in the linker sequences would provide 
manufacturing advantages, allowing different molecules to be assembled on the chapcrone scaffold 
In addition to the nucleic acid recognition units, optional linking sequences, and assembly 

25 sequences, the novel TBAs of this invention optionally have asymmetry or PILOT TNA sequences 

and one or mac OSA units. The asymmetry sequences arc provided to encourage or prevent certain 
desirable or undesirable associations. For example, in the event that a TBA having homodixneric 
p50 DNA recognition units is desired, the asymmetry sequences are provided to disrupt the naturally 
stronger association ofNF-kB p50 subunits and p65 subunits, while not disrupting the assembly 

30 sequences from bringing together p50 subunits. Examples of such sequences are provided herein 
as SEQ ID NOS. 85-92 and SEQ ID NOS. 105 and 106. 

In a different configuration, NF-kB p50 subunit sequences are brought into close association 
with transcription factor SP1 DNA recognition unit sequences. This is desirable in the event that 
an NF-kB/SPl binding motif is of significance, asintte HIV LTR where a motif of at least six DNA 
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binding protein recognition sites, two NF-kB, three SP1, and a TATA site are known to exist Since 
it is also known that the second NF-kB and first SP1 site air significant to regulation of HIV 
transcription (Perkins «f ai [1993]£m6oi 12:3551-3558), this particular configuration of TBA 
is useful not only in the detection of HIV, but as a therapeutic or prophylactic against HIV infection 
5 (see below). In a similar fashion, the long control region (LCR) of human papillomavirus may be 

used as a key control region for probing accenting to this method 

In view of the different elements that can be associated, cassette fashion, according to this 
method of TBA formation, an essentially unlimited variety of TBAs are produced In Figure 10, 
a series of different molecules, referred to as "HTV-detcct I-IV" are exemplified wherein "CHAP" 
10 denotes the chaperone, "nfkb" denotes NF-kB subunits, "spl" denotes die nucleic acid recognition 
unit of the SP1 transcription factor, and TATA* 9 denotes a dimcr of the DNA recognition unit of 
a TATA sequence DNA binding protein (TBP), also known as a TATA binding protein, or TBP. 
These configurations are fintfaa exemplified below and are all integral parts of the instant invention. 
In yet another configuration, die modular structure diown in Figure 9 is adapted to detection 
15 aixi or treatment or prophylaxis o In Figure 11, in a similar fashion 

to the above described M HIV-detcct I-IV" molecules, a series of "HPV-Dctect I-IV' molecules is 
produced In this embodiment, advantage is taken of the DNA binding properties of the E2 protein 
of human papillomavirus (HPV). In addition, the roles of SP1 and TBP are taken advantage of by 
providing specific DNA recognition units adapted to bind to these sequences in the HPV genome. 
20 In the formation of die E2-sperific TBAs for use in detecting HPV infection, it may be desirable to 

use any of SEQ ID NOS. 75-84 or 93-98 as the E2 DNA recognition units. A TBA containing a 
bovine E2 dimer and a human E2 dimer DNA binding domain may be particularly useful. 

The various sequences described above may either be chemically linked using pure 
oligopeptide starting materials, or they may be linked through provision of recombinant nucleic acids 
25 encoding, via the well known genetic code, the various subelements. In the event of recombinant 

production, linking era coding sequences to sequences of nucleic acid recognition units to form 
TBAs is advantageous because not only does era act as assembly sequences in the chaperone, it also 
acts to direct die proper folding of the nucleic acid recognition elements. Exemplary sequences for 
chaperones are provided herein as SEQ ID NOS. 104*108. Furthermore, in the event that higher 
30 order structures comprising multiple binding sites is desired, as in a pentameric NF-kB/NF- 

kB/SPl/SPl/SPl TBA, proper design of the asymmetry sequences allows such structures to be 
made. 

In die foregoing fashion, TBAs are prepared which bind to their cognate binding sites with 
high affinity. For example, the NF-kB DNA binding components of the TBAs of Figure 10 are 
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expected to bindtotheHIV-LTRwithanaffmty^ 10* and 10* 1 2 molar. Sequences 

useful as the DNA recognition units are provided as SEQ ID NOS. 63-71, 73-84, 93-98, and 104- 
108 and exemplified further below. 

In view of the foregoing description of directed assembly of nucleic acid binding proteins 
S using assembly and asymmetry (or piloting) sequences, those skilled in the art will recognize that 

a generally applicable method for assembling protein structures is provided by this invention. The 
generality of this method is demonstrated further by consideration, by way of further example, of the 
use of an antibody-epitope interaction in die assembly of desired structures. By way of specificity, 
a DNA binding protein structure may be assembled by linking an NF-kB pSO subunit to an antigen, 
10 such as a circularized (through disulfide bonds) melanocyte stimulating hormone (MSH). This pro- 
MSH molecule may then be bound by an and-MSH antibody to provide a novel nucleic acid binding 
assembly, with the antigen and antibody acting as assembly sequences. 

The modular structure provided by Figure 9 reveals that a great variety of TBAs may be 
assembled using different combinations of components. Thus, representative embodiments of this 
15 general structure are provided as SEQ ID NOS. 109-1 16. 

7, T>* Bftretgr Assemblies fBBAs) and their preparation A BBA may be any 

substance which binds a particular BBR formed by hybridization of particular PNAs and BNAs, 
including when multiple BNAs (up to and including V BNAs, i.e., BNA* , wherein "n" is 
theoretically 0-°°, but practically is between about 0 and 100) are polymerized onto the PNA for 
20 signal amplification, provided that the BBA must have at least the following attributes: 

(a) The BBA must bind the BBRs in a fashion that is highly specific to the BBR of 
interest That is, the BBA must discriminate between BBRs present in the PNA- 
BNA hybrid and similar duplex sequences in BNA-CNA hybrids or other CNAs. 
Thus, where even a single base mismatch or conformational differences with or 

25 without base mismatches occur in the production of the PNA-BNA, or PNA* 

BNA D -HNA hybrid, the BBA must bind the hybrid with a sufficiently low avidity 
that upon washing the TBA-TNA-PNA-BNA n complex, die BBA is displaced from 
the CNA sequences but not the BBR sequences. 

(b) The BBA must avidly bind the BBR(s). Binding affinities in the range of 1 0 5 to 
30 about 10* or higher are generally considered sufficient 

Examples of BBAs include, but are not limited to cro 9 and the bacteriophage lambda 
repressor protein, CI In addition, see U.S. Patent No. 4,556,643, herein incorporated by reference, 
which suggests other DNA sequences and specific binding proteins such as repressors, histones, 
DNA modifying enzymes, and catabolitc gene activator protein. See also EP 0 453 301, herein 
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incorporated by reference, which suggests a multitude of nucleotide sequence specific binding 
proteins (NSSBPs) such as the tetracycline repressor, the lac repressor, and the tryptophan repressor. 
Each of these BBAs has been recognized in the art to bind to particular, known nucleic acid 
sequences and the affinities of these interactions are known. Naturally, the method of the instant 
5 invention is not limited to die use of these known BBAs. From the instant disclosure, ooe of 
ordinary skill could easily apply the use of novel BBAs exhibiting at least the required attributes 
noted above to the instant method. 

Examples of novel BBAs useful according to this aspect of the invention include novel 
proteins based on the motif of a known DNA orRNA or DNA.RNA binding protein such as era or 

10 the k CI reprtSSOr protein Preferably, giieh mnrfifaiitinng at* mitffc fn iirtpmvr th* Hmvtiing nf thw 

components of the invention. Thus, it may be desirable to add a high concentration of era to an 
assay. One of the negative qualities of era is that at high concentrations, die binding of era to its 
DNA target comes into competition with era-era interactions. Thus, for example, a chaperoned or 
mutated era may be produced which does not have this shortcoming. Examples of such altered 

15 duperooes are SEQ IDNOS. 105-106 and 108. Methods known in the art, such as production of 
novel target binding proteins using variegated populations of nucleic adds and selection of 
bacteriophage binding to particular, pre-sdected targets (i.e., so-called phage-display technology, 
see discussion above for production of novd TBAs) may be used to produce such novel BBAs as 
well as the aforementioned novd TBAs . 

20 Where the BBA is a protein, or a complex of proteins, it will be recognized that any of a 

number of methods routine in the art may be used to produce the BBA. The BBA may be isolated 
from its naturally occurring awinMmeminnature, or if this is im 

techniques of molecular biology. Thus, for example, the sequence of the era protein is known and 
any molecular clone of bacteriophage lambda may be used to obtain appropriate nucleic adds 
25 encoding era for recombinant production thereof In addition, the TBAs described herein may be 
used as BBAs, provided that different TBAs are used to bind TBRs and BBRs 

fi Thcuy^f Bftftft and Tfflfo ^ \nt^ pnplpfy fa fo? M »*fa" fr e PNA-TNA- 

TBA complexes (see Figure 8). In one embodiment of this invention, the highly specific and 
extremely tight binding of TBAs comprised of nucleic acid binding components is used to produce 
30 an amplifiable nucldc acid sandwich assay. According to one aspect of this embodiment, a solid 
support is coated with a first TBA creating an immobilized TBA. In solution, a PNA and TNA are 
contacted under hybridizing conditions and then contacted with the immobilized TBA. Only those 
PNA-TNA interactions which form the specific TBR recognized by the immobilized TBA are 
retained upon wash-out of the solid surface which binds the TBA-TBR complex. 
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Detection of the bound TBR is accomplished through binding of Booster Nucleic Acids, 
BNAs, to die 1/2 BBRs present on the PN As under hybridizing conditions. In this maimer, even if 
only a single TBA-TBR complex is bound to the immobilized TBA, a large, amplified signal may 
be prochxad by polymerizing multiple BNAs to the immobilized TNA. Each BNA which binds to 
5 die TNA forms a BBR which can be bound by BBAs which, like the TB As immobilized on the solid 

surface, may be chosen for their vay tight and specific binding to particular nucleic acid structures* 
Thus, according to this embodiment, the immobilized TBA may contain the DNA binding portion 
of NF-kB, which very specifically and tightly binds to NF-kB binding sites formed upon 
hybridization of the TNA and PNA to form such a site. 

10 Because it is well known that there are NF-kB binding sites both in the normal human 

genome and in the long terminal repeats of human immunodeficiency virus (HIV), this invention 
provides a method of discriminating between the "normal" human sites and the sites present in cells 
due to HIV infection. Therefore, in a test designed to determine the presence or absence of HIV 
DNA in a sample of human DNA, the HIV NF-kB binding sites may be viewed as the TNA, and the 

IS normal human NF-kB binding sites may be viewed as CNAs. According to the method of this 

invention, discrimination between these TNAs and CNAs is accomplished by taking advantage of 
the fact dial in Ac HIV LTR, there are two NF-kB binding sites, followed by three SPI sites (see, for 
example, Koken et al [ 1992] Virology 191 :968-972), while cellular NF-kB binding sites with the 
same sequences are not found in tandem. 

20 In cases where the TNA contains more than one 1/2 TBR and it is desirable to pursue the 

therapeutic and prophylactic applications of the TBAs, it may be desirable to use more than one 
TBA, each with the capacity to bind a TBR in the TNA-PNA complex. In this case, it may be 
advantageous to select, as components of the TBAs, DNA-binding or RNA-binding domains with 
lesser affinity for its TBR than the wild-type DNA-binding or RNA-binding domain. Given that the 

25 TBAs which are involved in the binding to the multiple TBRs can either assemble together before 

binding to their TBRs or assemble together after binding to their TBRs, the individual TBAs will 
not block the corresponding TBRs in the other genomes than the target genome unless the TBRs art 
spatially capable of binding the assembled TBA complex. One feature of the multimeric assembly 
of TBAs which is specifically claimed here as part of this invention is that such a multimeric 

30 assembly is expected to have a much reduced affinity for a single site within the TNA. However, 

since the binding is dramatically increased relative to any one TBA, the TBA complex would be 
expected to not compete for the binding of any single TBR with the corresponding native proteins 
in situ but bind tightly to sequences in the PNA-TNA hybrid containing the TBRs for each of the 
nucleic acid-binding components assembled in the TBA. The TBA complex should be assembled 
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and linkers adjusted in the individual TBAs so as to allow the nucleic acid-binding regions contained 
in the TBA complex to simultaneously reach and bind to these targets. 

Once the TNA-PNA hybrids have formed and been contacted with the immobilized TBA, 
unbound wdeic acid is washed from the immobilized surfm and the ^ hybrids detected. 

5 This is accomplished in any one of several ways. In one aspect of this invention, the PNA is labeled 

with an OSA, such as a radionuclide, colored beads, or an enzyme capable of forming a colored 
reaction product Furthermore, in addition to having one or more 1/2 TBRs, the PNA also may 
contain at least one 1/2 BBR. The 1/2 BBR sequences are chosen so as to be complementary to 
unique 1/2 BBR sequences in BNAs. In the embodiment described above, for example, where the 
1 0 TBA is NF-kB and the TBR formed upon TNA-PNA hybridization is one or more NF-kB binding 
sites, the 1/2 BBRs may provide hybridizable (that is, single-stranded, complementary) sequences 
of the left or right bacteriophage lambda operators (see, for example, Ptashnc [1982] Scientific 
American 247:128-140, and references cited therein for sequences of these operators). These may 
be polymerized onto the PNA 1/2 BBRs in a vectorial fashion (see Figures 2 and 3) providing up 

15 to "n" BBRs, and each BBR forms a cro binding site. Enzymatically, radioactively, or otherwise 
labeled cro, is contacted with the TBA-TNA-PNA-CBNAX, complex. In this fashion, a highly 
selective and amplified signal is produced Signal produced using a PNA having a single 1/2 TBR 
indicates success of the asssy in achieving TBA-TBR binding and polymerization of the BNAs to 
produce signal from cellular sites (i.c. from CNAs). Absence of signal when a dimcriyed TBA is 

20 used indicates that in the TNA, there were no HIV LTRs as no double NF-kB binding sites were 
present On the other hand, presence of signal using the dimer NF-kB indicates HIV infection. As 
a specific example of the foregoing description of this embodiment of the invention, see Example 
6 describing an HTV test kit 

Naturally, those skilled in the art will recognize that the foregoing description is subject to 

25 several modifications in the choice of PNAs, TNAs, TBAs, BNAs, and BBAs. Furthermore, in 
systems other than HIV, those skilled in the art will recognize that the general method described 
above could be likewise applied. However, these other applications may be simpler than the above 
described method as the TBAs iced may not recognize any dotm] cellular sites and therefore resort 
to dimerization or other methods of discriminating between TNAs and CNAs may be less critical. 

30 In riffrigning probes and binding assemblies for these other systems, the skilled artisan will be guided 
by the following principles and considerations. 

In the above-described embodiment, the appeal of using the DNA-binding portions of NF- 
kB protein as the TBA and the NF-kB recognition binding elements as the TBRs is that these 
dements form an important ''control point" for the replication of HIV. That is, it is known that HIV 
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is required touseNF-kB as a critical feature in its replicative life cycle. Similar control points for 
other pathogens are chosen and used as a basis for detection according to the methods described 
herein. 

From the foregoing description of general features of this invention and the mode of its 
5 operation, one skilled in the art will recognize that there are a multiplicity of specific modes for 
practicing this invention. By way of example, the method of this invention is adaptable to a method 
and devices using chranatographk test kits described in U.S. Patent Nos. 4,690,691 and 5,310,650 
(the '69 1 and "650 patents). In those patents, a porous medium was used to immobilize either a TNA 
or a capture probe, and a solvent was used to transport a mobile phase containing either a labeled 
10 PNA, if the TNA was immobilized, or the TNA, if a capture probe was immobilized, into the 

"capture zone." Onoe the TNA was bound in the capture zone, either by directly immobilizing it or 
through capture, a labeled PNA was chncmatographed through the capture zone and any bound label 
was detected 

Adapting the instant invention to such a system provides the improvement of using a Target 

15 Binding Assembly in the capture zone and therefore, the capture of only perfectly matched TBR 

sequences or other TBRs representing nucleic acid confirmations specifically bound by the TBA 
within the TNA-PN A duplexes by virtue of the previously described sensitive discrimination by the 
TBA between TNAs and CNAs. 

Once the TNA-PNA hybrids become bound to the immobilized TBA, the signal is amplified 

20 by adding BNAs or chromatographing BNAs through the capture zone. Finally, the signal may be 

further amplified by adding BBAs or chromatographing labeled BBAs through the capture zone. 
In this fashion, the ease of performing the analysis steps described in the '691 and '650 patents is 
improved upon herein by providing the additional ability to increase the specificity and, through 
amplification, the sensitivity of the method described in those patents. The disclosure of the '691 

25 and '650 patents is herein incorporated by reference for the purpose of showing the details of that 

method and for the teachings provided therein of specific operating conditions to which the 
compositions and methods of the instant invention are adaptable. 

Those skilled in the art will also recognize that the method of the instant invention is 
amenable to being run in microliter plates or to automation. The use of machines incorporating the 

30 method of this invention therefore naturally falls within the scope of the instant disclosure and the 
claims appended hereto. Thus, for example, this invention is adaptable for use in such instruments 
as Abbott Laboratories* (Abbott Park, IL) lMx tabletop analyzer. The IMx is currently designed to 
run both fluorescent polarization immunoassay (FPZA, see Kier [1983] KCLA 3:13-15) and 
microparticle enzyme immunoassay (MEZA, see Laboratory Medicine, Vol. 20, No. 1, January 
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1989, pp. 47-49). The MEZA method is easily transformed into a nucleic acid detection method 
ising the instant invention by using a TBA as a capture molecule coated onto a submicron (<0.5 jim 
on average) sized microparticlc suspended in solution. The micxoparticles coated with TBA arc 
pipetted into a reaction cell. The IMx then pipettes sample (hybridized PNA-TNA) into the reaction 
5 cell, forming a complex with the TBA. After an appropriate incubation period, the solution is 
transited to an inert glass fiber matrix for which die particles have a strong affinity and to which 
the microparticles adhere. Either prior to or after filtering the reaction mixture through the glass 
fiber matrix, BNAs and BBAs are addbd, or mother signal amplification and detection means is used 
which depends on specific formation of TNA-PNA hybrids. The immobilized complex is washed 

10 and the unbound material flows through the glass fiber matrix. 

The bound complexes are detected by means of alkaline phosphatase labeled BBAs or 
otherwise (radioactively, enzymatically, fluorescently) labeled BBAs. In the case of alkaline 
phosphatase labeled BBAs, the fluorescent substrate 4-methyl umbdliferyl phosphate or like reagent 
may be added Alternatively, the enzyme may be bypassed by directly labeling BBAs with this or 

15 a like reagent In any event, fluorescence or other signal is proportional to the amount of PNA-TNA 
hybrids present 

The fluorescence is detected on the surface of the matrix by means of a front surface 
fluoromrtcr as dcyribH by mmwifar tinrr nf the TMx With minor adjustments that can be made 
through routine experimentation to optimise an instrument such as the IMx for nucleic acid 

20 hybridization and nucleic acid-TBA interactions, the instant invention is completely adaptable to 
automated analyses ofTNA samples. 

9. Other diagnostic applications of this invention. While the foregoing description enables 
the use of the instant invention in a number of different modes, many additional utilities of this 
invention are readily appreciated, for example, in a mobility retardation system. 

25 In this embodiment of the invention, an improvement of the well known clcctrophorctic 

mobility shift assay (EMSA) is conducted as follows (See Figures 12a and 12b): 

A sample of DNA is fragmented, either through random cleavage or through specific 
restriction aidoouckase treadMnL The DNA in the sample is then split into two equal aliquots and 
a specific TNA is added to the first aliquot but not to the second The first and second aliquot are 

30 then electrophoresed in an acrylamide or agarose gel, and the pattern of DNA bands (either 
visualized through ethidium bromide binding or through being radioactively labeled prior to 
electrophoresis is then compared for the two aliquots. Fragments of DNA having binding sites to 
which the TBA is specific are retarded in their migration through the electrophoretic medium. By 



WO 96/17956 



PCT/US95/15944 



38 

using an appropriate TBA, any number of DNA or other nucleic acid sequences may be tracked in 
this fashion. 

In a modification oftbcEMSA described above, fragmented TNA is hybridized with a PNA 
and fractionated in a first dimension. The fractionated DNA is then reacted with an appropriate TBA 
5 and the change in mobility of the DNA fragments is noted. Enhancement of the retardation is 

possible by adding BBAs as described above. (See, for example, Vijg and references cited therein 
for known techniques of two (2) dimensional nulceic acid electrophoresis, to which the instant 
method may be applied). 

10. Therapeutic applications . Because of the very tight and selective nucleic acid binding 
] 0 characteristics of the novel TBAs described herein, therapeutic utilities are contemplated in addition 
to the diagnostic utilities of these compounds. Thus, a TBA comprising tight and specific binding 
for the HIV-LTR, by virtue of having an NF-kB p50 and an SP1 DNA recognition unit in close 
association (see Figure 10, HIV-Detect II) is useful to bind up the HIV-LTR and thereby prevent 
transcription from this key dement of the HIV genome. The unique features of the assembly 
IS sequences of the TBA allow recombinant vectors to introduce DNA encoding such a TBA into a cell 

and die proper folding of the expressed sequences. Once inside the cell, the nuclear localization 
signals of the p50 subunit directs the transport of the TBA to the nucleus where it binds tightly to 
die LTR of any integrated HIV, effectively shutting the pathogen down. In a prophylactic mode, one 
that is concerned about potential HIV exposure is administered a sufficient dose of a TBA or a 
20 recombinant vector able to express the TBA, so as to lock up any HIV that might have entered the 
person In this mode, the use of the TBA is analogous to passive protection with a specific immune 
globulin. In the therapeutic or prophylactic mode, NLS sequences art used in place of the OSAs 
used in tte diagnostic mode. Exemplary NLS sequences are provided as SEQ ID NOS. 72 and 103 
(sec also Heinzinger 1994 and Bukinsky 1993, describing NLS sequences of the HIV Vpr and gag 
proteins respectively). In any event, the TBA is administered in a phannaceutically-acccptable 
earner, known in the art such as a sterile salt solution or associated with a liposome or in the form 
of a recombinant vector, preferably one which directs expression of the TBA in a chosen cell type, 
or by a protein delivery system. 
H. Embodiments of the Invention 

In view of the foregoing description and the examples which follow, those skilled in the art 
will appreciate that this disclosure describes and enables various embodiments of this invention, 
including: 
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1. A probe nucleic acid (PNA) comprising: 

(a) a single-stranded sequence, 1/2 TBR, which is capable of forming, under 
hybridizing conditions, a hybrid, TBR, with a 1/2 TBR present in a target nucleic 
acid(TNA); 

5 (b) 2en>, one or more, and preferably one to ten single stranded sequences, 1/2 BBR, 

which is capable of forming, under hybridizing conditions, a hybrid BBR, with a 
1/2 BBR present in a booster nucleic acid (BNA); and 
(c) an OS A, which is no attached support and/or indicator, or an attached support or 
other means of localization, including, but not limited to, attachment to beads, 
10 polymers, and surfaces, and/or indicators; 

wherein said TBR is capable of binding with high affinity to a TBA, said TBA being a substance 
capable of discriminating between a paired TBR and a TBR having unpaired nucleotides, and 
further, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a 
substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 
IS This embodiment includes TBRs which are nucleic acid binding protein recognition sites, such as 
the HIV LTR, and other nucleic acid binding proton recognition sites in other pathogens, some of 
which are noted above. The PNA of this embodiment of the invention may produce a TBR which 
is a nucleic arid bindii^ protemi^^ she present in the genome of a pathogen or is a binding 
site associated with a pathogenic condition in the human genome or a contaminant in a fermentation 
20 process. 

2. A booster nucleic acid (BNA) comprising: 

(a) a 1/2 BBR which has a sequence which is complementary to a 1/2 BBR sequence 
in a PNA or another BNA already hybridized to the PNA and which is capable of 
forming, undef hybridizing conditions, a hybrid, BBR, with the PNA; 
25 (b) an OSA attached support or other means of localisation, including, but not limited 

to, attachment to beads, polymers, and surfaces, and/or indicators; and 
(c) additional hybridization sites, 1/2 BBRs, for hybridization with additional BNAs; 
wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance 
capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 
30 3. A Hairpin Nucleic Acid (HNA) comprising a single-stranded sequence, 1/2 BBR, which 

under hybridizing conditions is capable of forming a hairpin while at the same time binding to a 
BNA to form a BBR capable of binding a BBA, wherein said BBR is capable of binding with high 
affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and 
a BBR having unpaired nucleotides. 
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4. A method for detecting a specific TNA sequence, comprising the steps of: 

(a) hybridizing said TNA with a PNA as described above; 

(b) hybridizing said PNA with a BNA containing a 1/2 BBR whose sequence is 
complementary to a 1/2 BBR sequence in the PNA; 

(c) adding the products of steps (a) axid(b)eontaiiunga TBRandaBBR,u>asurfiK^ 
liquid or other medium containing a TBA; 

(d) adding BBAs to the mixture in step (c) wherein said BBA comprises: 

(i) a molecule or a portion of a molecule which is capable of selectively 
binding to a BBR; and 

(ii) a detectible indicator, 

and 

(e) detecting signal produced by the indicator attachod to the BBA. This method 
includes the use of a protein indicator, including enzymes capable of catalyzing 
reactions leading to production of colored reaction products. It also includes 
indicators such as a radionuclide or colored beads. 

5. A method for detecting the presence in a sample of a specific Target Nucleic Acid, TNA, 
which comprises: 

(a) contacting said sample with a Probe Nucleic Acid, PNA, which, upon hybridization 
with said TNA if present in said sample, forms a Target Binding Region, TBR, 
which is capable of binding a Target Binding Assembly, TBA; 

(b) contacting said sample, already in contact with said PNA, with a TBA capable f 
binding to my TBRs finned by the l^dizatknofsaidPNAandsaidTNA inthe 
sample. 

6. A method for detecting or localizing specific nucleic acid sequences with a high degree 
of sensitivity and specificity which comprises: 

(a) adding PNAs containing a 1/2 BBR and a 1/2 TBR to a sample containing or 
suspected of containing TN As containing 1/2 TBR sequences, to form a complex 
having target binding regions, TBRs, formed by the hybridization of 
complementary 1/2 TBRs present in the PNAs and TN As respectively; 

(b) binding the TBRs formed in step (a) to an immobilized TBA to form a TBA-TNA- 
PNA complex; 

(c) adding Booster Nucleic Acids, BNAs, containing booster binding regions, 1(2 
BBRs, to the complex formed in step (b) such that the 1/2 BBRs in the BNAs 
hybridize with the 1/2 BBR sequences present in the PNAs or to 1/2 BBRs present 
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in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA- 
(BNA) n complexes are formed; 

(d) adding Hairpin Nucleic Acids, HNAs, containing 1/2 BBR sequences, to the 
conpkx fonned in step (c) such thai the 1/2 BBRs in the HNAs hybridize with any 

5 available 1/2 BBR sequences present in the BNAs of the complex of step (c), 

thereby capping the extension of the BNAs onto the TBA-TNA-PNA-flBNA^ 
complexes of step (c) to form TBA-TNA-PNA-(BNA) 0 -HNA complexes; 

(e) adding Booster Binding Assemblies, BBAs, linked to indicator moieties, to the 
TBA-TNA-PNA-{BNA) n -HNA complexes formed in step (d) to form TBA-TNA- 

10 PNA-(BNA-BBA)„-HNA complexes; and 

(£) detecting the signals pioduoed by the indicator moieties linked to the TB As, PN As, 
BNAs, BBAs or HNAs in the TBA-TNA-PNA^BNA-BBA^-HNA complexes of 
step(e); 

wherein: 

15 the TNA comprises: 

(i) one or more specific 1/2 TBR nucleic acid sequences, the presence or 

absence of which in a particular sample is to be confirmed; 
the PNA comprises: 

(i) a single-stranded sequence, 1/2 TBR, which is capable of forming, under 
20 hybridizing ecHxfcti^ with a 1/2 TBRpresent in a target 

nucleic acid (TNA); 

(ii) a single stranded sequence, 1/2 BBR, which is capable of forming, under 
hybridizing conditions, a hybrid BBR with a 1/2 BBR present in a booster 
nucleic acid (BNA); and 

25 (iii) an OSA, which is no attached support and/or indicator, or an attached 

support or other means of localization, including, but not limited to, 
attachment to beads, polymers, and surfaces, and/or indicators; 
the BNA comprises: 

(i) a 1/2 BBR, as shown in Figure l(IIb), which has a sequence which is 
30 complementary to a 1/2 BBR sequence in a PNA and which is capable f 

forming, under hybridizing conditions, a hybrid, BBR, with the PNA; 

(ii) an OSA attached support or other means of localization, including, but not 
limited to, ffttnrhmnrt to beads, polymers, and surfaces, and/or indicators; 

(iii) additional hybridization sites, 1/2 BBRs, for other BNAs; and 
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(iv) sequences, 1/2 BBRs, which can hybridize to BNAs already hybridized to 

thcPNA; 
the BBA comprises: 

(i) a molecule or a portion of a molecule which is capable of selectively 
5 binding to a BBR; and 

(ii) no attached support and/or indicator, or an attached support or other 
means of localization, including, but not limited to, attachment to beads, 
polymers, and surfaces, and/or indicators; 

and the TBA comprises: 

10 (i) a molecule or a portion of a molecule which is capable of selectively 

binding to a TBR; and 
(ii) no attached support and/or indicator, or an attached support or other 
means of localization, including, but not limited to, attachment to beads, 
polymers, and surfaces, and/or indicators. 
15 7. An improvement to a solid phase hybridization method for detecting the presence of a 

target polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, 
directly or via an intennediate capture structure, on a solid phase at a capture site; before, during or 
after said immobilization, attaching a detectable label to said target polynucleotide, if present; and 
detecting said label, if any, at said capture site; the improvement comprising: 
20 (a) using a Target Binding Assembly, TBA, as the means for achieving immobilization 

of said target polynucleotide, wherein said TBA binds only to a perfect hybrid 
formed between a specific Probe Nucleic Acid, PN A, and said target nucleic acid 
such that a perfect Target Binding Region, TBR, recognizable by said TBA is 
formed; and 

25 (b) including in the PNA a single stranded sequence, 1/2 BBR, capable of binding a 

Booster Nucleic Acid, BNA, containing a single stranded complementary 1/2 BBR 
which, upon hybridization with the 1/2 BBR in the PNA, forms a BBR capable of 
binding labeled Booster Binding Assemblies, BBAs. 
8. A target binding assembly, TBA, comprising one or more nucleic acid recognition units, 
30 linker sequence(s), assembly sequenced), asymmetry sequenced), nuclear localization signal 
sequenced) (NLS) and OSA(s). The nucleic acid recognition unit may be an NF-kB binding unit, 
an SP1 binding unit, a TATA binding unit, a human papillomavirus binding unit, an HIV LTR 
binding unit, or a binding unit for any other fragment of specific sequence the detection of which is 
desirable and which can be achieved through specific association with the TBA. Such recognition 
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units include, but are not limited to those exemplified herein as SEQ ID NO. 63, SEQ ID NO. 64, 
SEQIDNO. 65,SEQIDNO. 66, SEQ ID NO. 67, SEQ ID NO. 68, SEQ ID NO. 69, SEQ ID NO. 
70, SEQ ID NO. 71, SEQIDNO. 72, and SEQ ID NO. 73. Linker sequences such as oligopeptides 
winch do not interim with the nucleic acid recognition function of the nucleic acid recognition unit 
5 aal which provide stability and control over the spacing of the nucleic acid recognition unit from the 

icmaindo-oftheTBA. Examples of such linker sequenoes art wen known in them and include, but 
are not limited to oligopeptide sequences from the interdomain primary sequence of a structural 
protein. Assembly sequences include oligopeptide sequences which direct the folding and 
association of nucleic acid recognition units. A preferred example of such sequences are 

10 oligopeptides derived fran die bacteriophage lambda cro protein. The asymmetry sequence directs 

the ft<?yyi*t"™ of nucleic acid recognition and assembly sequences in a predetermined order. Such 
asymmetry sequences are exemplified by sequences derived from insulin, relaxin, gonadotropic 
hoimone, FSH, HCG, LH, ACTH, including but not limited to SEQ ID NOS. 85-92. With reference 
to Figures 14 and IS, SEQ ID NO. 85 is an "A" and SEQ ID NO. 86 is a **B M sequence; SEQ ID 

15 NO. 87 is an "A" and SEQ ID NO. 88 is a W B" sequence' SEQ ID NO. 89 is a human relaxin tt A" 
and SEQ ID NO. 90 is a human relaxin "B" sequence; SEQ ID NO. 9 1 is a skate relaxin "A" and 
SEQ ID NO. 92 is a skate relaxin "B M sequence. In addition, the TBA may contain nuclear 
localization signal sequences, NLS, which direct the migration and uptake of a protein or complex 
associated with said NLS into the nucleus of a cell. Examples of such NLS sequences are provided 

20 as SEQ ID NOS. 72 and 103. Preferred embodiments of the TBA include but are not limited to HIV 

Detect I-IV or HPV Detect I-IV, and SEQ ID NOS. 109-1 16. 

9. Methods of using the novel TBAs of this invention include, but are not limited to a 
method of using the TBA to bind a particular nucleic acid sequence in a target nucleic acid sample 
which comprises: 

25 (a) fragmenting the nucleic acid in the target nucleic acid sample; 

(b) contacting, under hybridizing conditions, the fragmented nucleic acid with a probe 
nucleic acid complementary to the particular nucleic acid sequence of interest, 
wherein said probe nucleic acid, upon hybridization with said particular nucleic 
acid sequence of interest forms a target binding region to which said TBA 
30 specifically binds. 

In this method, the probe nucleic acid, in addition to sequences complementary to said particular 
nucleic acid sequence of interest, also may have additional sequences to which a booster nucleic acid 
can bind to form a booster binding site to which a labeled booster binding assembly can bind to 



W 96/17956 



PCTAJS95/15944 



44 

provide a signal showing and amplifying the binding of the probe nucleic acid to the target nucleic 
acid sequence of interest 

An jKMitimsil aspect of this invention not requiring fragmentation of Target Nucleic Acid, 
involves administration of the TBA to a patient in need of such treatment of a therapeutically or 
5 prophylacticaUy effective amount of said TBA, which comprises administering the TBA, either in 
the farm of a purified protein complex or mtteforoofa recombinant vector wh^ 
the patient is able to express the TBA, such that the TBA binds the particular nucleic acid sequence 
to achieve the desired prophylactic or therapeutic result This may include providing a dosage which 
cro be detenntnod by routine experimentation to be sufficient to prevent establishment of an active 

10 infection by a pathogen. Dosages of purified TBAs may be in the range of about 0.001 to 100 
mg/kg. When provided as a recombinant expression vector which will direct the in vivo expression 
and folding of the TBA, dosages of the recombinant nucleic acid may be substantially lower, 
particularly if provided in the farm of nonpathogenic viral vector. The methods of using the TBAs 
also include monitoring the shift in mobility of nucleic acids in target nucleic acid samples as a 

IS fanctknofthe size such that bn^^ TBA to a particular fragment in the sample modifies the 

mobility of the fragment This aspect of the method provides a useful method of analyzing nucleic 
acid fragments for particular aberrations, such as might be found associated with metastases. 

10. Diagnostic or forensic kits useful in determining the presence of an infection, the 
susceptibility to a disease, or the origin of a particular nucleic acid containing sample. 

20 11. A method of assembling multimcric TBAs in vivo which comprises introducing nucleic 

acids encoding component TBAs into a cell. The component TBAs should each contain a nucleic 
acid recognition unit, assembly sequences, asymmetry sequences, and nuclear localization signal 
sequences. Linker sequences, optionally included if TBA footprinting experiments indicate the need 
for such linkers to attain optimal goometry of the multimeric TBA. Upon in vivo expression of each 

25 component TBA and proximal binding, via the nucleic acid recognition unit of each component TBA 

to nucleic add sequences encountered in the nucleus or elsewhere in the cell, component expressed 
TBAs are directed to assemble via d« included assembly and asymmetry sequences into multimeric 
TBAs. As described above, such multimcric TBAs will have the advantage of binding specifically 
with high affinity to TBRs in a specific target sequence, but not at all or with very low affinity to 

30 cousin nucleic acids. 

The foregoing description of the invention will be appreciated by those skilled in the art to 
enable preferred embodiments as well as the best mode of this invention. Without limiting the 
subject matter to the specifics of the examples provided hereinafter, the following examples are 
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provided to further guide those skilled in the art on methods of practicing this invention. Standard 
recombinant DNA techniques as disclosed in Sambrook, Fritsch, and Maniatis (\9%9) Molecular 
Cloning: A Laboratory Manual, 2nd Ed, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, and more recent texts are not disclosed as these are now well within the skill of the 
5 ordinary artisan. 

Example I - Preparation of PNAs and LabeH *iR ?f PNfts 

Probe nucleic acids, PNAs, may be prepared by means well known in the art Thus, single 
stranded polynucleotide PNAs of defined sequence may be prepared via solid phase chemical 

10 synthesis according to Mcnifield PNAs may be prepared by automated synthesis using 
c omm e rci a ll y available technology, such as resins and machines prxxluced or marketed by Applied 
Biosystcms, ABI, or other manufacturers. Alternatively, through known recombinant DNA methods, 
particular PNA sequences are synthesized in vivo, for example by cloning a duplex PNA into a 
vector which can replicate in £. coli 9 large quantities of the duplex PNA may be prepared. 

IS MuttmenoftfaePNAmaybeck^ 

of PNA is liberated upon digestion of the vector with a restriction fragment flanking the PNA 
sequence. Subsequent to synthesis or recombinant production, the PNAs are purified by methods 
well known in die art such as by gd electrophoresis or high pressure liquid chromatography (HPLC). 
If the PNA is produced as a duplex, prior to use in a hybridization assay for detection of target 

20 nucleic acid sequences, the strands of the PNA are separated by heating or other methods known in 

the art 

The specific sequence of bases in the PNA is chosen to reflect the sequence to be detected 
in a TMA, with the proviso that, according to this inventk^ the PNA contains a 1/2 TBR sequence, 
which is one that upon hybridization of the PNA and TNA, a TBR is formed As there are an 
25 essentially unlimited number of such sequences known in the art, the choke of the PNA sequence 

is amenable to selection by the skilled researcher for any given application. The sequence of the HIV 
LTR is one such sequence, which upon hybridization of a PNA encoding portions of the LTR with 
TNAs encoding the HIV LTR, TBRs capable of binding the NF-kB or SP1 DNA binding proteins 
are formed 

30 In addition to sequences which will form a TBR upon hybridization, the PNA also may 

contain a 1/2 BBR This sequence is one which, upon hybridization with a booster nucleic acid, 
BNA, forms a BBR which is capable of binding a BBA. The BBA is preferably a DNA binding 
protein having high affinity for the BBR sequence. 
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In this particular exmnpk, hybridizatto SEQIDNO. 
4 and, at the 3' end of that sequence, a 1/2 BBR sequence shown as SEQ ID NO. 35. The PNA 
encoding these sequences is cither used without labeling or is labeled with a radioactive isotope such 
as P 32 , S S5 , or a similar isotope, according to methods known in the art Alternatively, the PNA is 
5 bound to a bead of betwecnO.OI to 10^ which may be colored for easy visual det^^ This 
label forms the OSA as described in the specification. This probe hybridizes with HIV LTR 
sequences to fonn a TBR that binds NF-kB. In addition, the PNA hybridizes with BNAs having 
a complementary 1/2 BBR to fonn a bacteriophage lambda left operator that binds either cro or 
lambda repressor proteins. 

10 In a manner similar to that described above,PNAs are used wherein die 1/2 TBR is any one 

of SEQ ID NO. 5 or SEQ ID NOS. 7-34, and a 1/2 BBR, such as SEQ ID NO. 35 or SEQ ID NO. 
36iseithaatthc3'endor5'endofthe 1/2 TBR 

Example 2 - Preparation and Labeling of BNAs 

IS Similar to die methods described in Example I for preparation and labeling of PNAs, BNAs 

are prepared and labeled according to methods known in the art. As described in U.S. Patent No. 
4,556,643, herein incorporated by reference (see particularly Example I), nucleic acid sequences 
encoding particular nucleic acid binding sequences may be mass produced by cloning into a 
itplicable vector. Furthermore, similar to that disclosure, the 1/2 TBR and 1/2 BBR sequences may 

20 be co-linearly produced in this fashion, with the distinction, however, that according to the instant 
invention, the 1/2 TBR sequence itself forms a nucleic acid binding component recognition site and 
the 1/2 BBR, while forming a nucleic acid binding component recognition site, also provides a means 
of amplifying the signal produced upon binding of the 1/2 TBR to complementary sequences in the 
TNA by providing for polymerization of BNAs onto the TNA bound PNA. To enable this, a 

25 sequence such as SEQ ID NO. 35, which enoodes the left operator of bacteriophage lambda, is 

provided with additional sequences such that an overhang sequence is created on one or both ends 
of the BNA upon hybridization with the PNA. 

As a specific example, vectorial polymerization of BNAs onto a TNA is provided by SEQ 
ID NOS. 40-43. In this example, SEQ ID NO. 40 encodes two 1/2 TBRs which will hybridize with 

30 two 1/2 TBRs in a TNA to form two NF-kB binding sites, while at the same time providing a 
bacteriophage lambda left operator 1/2 BBR, which additionally is terminated at the 3' end with the 
recognition site for the restriction enzyme Pstl Addition of the BNA, SEQ ID NO. 41, with the 1/2 
BBR complementary to the 1/2 BBR on the PNA, SEQ ID NO. 40, completes the BBR while at the 
same time completing the Pstl recognition site, leaving a four base overhang for hybridization with 
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additional BNAs. Accordingly, SEQ ID NO. 42 is added which h^ 

3 'end which is complementary to the four-base overhang remaining fiom the hybridization of SEQ 
IDNOS. 40ar*14L In addition, SEQ ID NO. 42 is provided with a five base sequence at its 5' end 
which forms part of a BamW recognition site. The growing polymer of BNAs is extended further 
5 byadditicooftheBNA SEQIDNO. 43, which is complementary to SEQ ID NO. 42, completing 

the BBR while at the same time completing the BamK recognition site and leaving a four base 
overhang which may be further hybridized with BNAs having complementary sequences. In this 
fashion, the BNAs may be hybridized extensively so as to greatly amplify the signal of a single PNA- 
TNA hybridization event 

10 As with the PNAs described in Example 1, the BNAs may be used in an unlabeled form or 

may be labeled according to methods known in the art and described in Example 1. It will also be 
appreciated that, rather than produce the BNA polymer by sequential addition of BNAs to the PNA- 
TNA complex, the BNA polymer may be preformed and added directly to the PNA-TNA complex. 
One simple method for preforming such a BNA polymer includes the recombinant production of a 

15 vector in which mubimers of the BNA ait provided with a unique restriction site at either end of the 
polymer. This polymer of BNAs containing multiple BBRs iscut out of the vector and hybridizes 
to a single stranded 1/2 BBR remaining in the PNA upon hybridization of the PNA and the TNA. 
This is accomplished by providing a single stranded sequence in the PNA complementary to an 
overhang produced in the BNA polymer when it is excised fiom the production vector. 

20 

Example 3 - Production of HNAs and Their Use for Capping BNA Potvmers 

The HNAs of this invention are produced according to methods known in the art for 
polynucleotide production as described in Examples 1 and 2 for PNAs and BNAs, In the production 
of the HNAs, however, the soqueoce of the HNA is specifically designed so that a substantial portion 

25 of the HNA forms a self-complementary palindrome to form a hairpin, while at the same time, 
leaving in single stranded form enough bases to be able to hybridize with single stranded sequences 
in the growing chain of BNAs described in Example 2. 

In tins Example, a HNA of S EQ ID NO. 44 is provided to cap the extension of BNAs onto 
the PNA in Example 2 after the addition of the BNA, SEQ ID NO. 43. This is accomplished 

30 because SEQ ID NO. 44, while having a palindromic sequence that forms a stable hairpin, also has 

a sequence at the 5' end of the HNA which completes the BamYR sequence formed by the 
hybridization of SEQ ID NO. 42 and SEQ ID NO. 43. Naturally, termination of the polymer after 
addition of only 3 BNAs is for the purpose of simplicity in demonstrating the invention. As 
described above, this polymerization may be continued essentially indefinitely to amplify the signal 
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of the PNA-TNA hybridization event Once the HNA hybridizes to the growing chain of BNAs, the 
polymer is capped and no further extension of the polymer is possible. 

Example 4 - Preparation of TBAs and BBAs. Labeling, and Immobilization Thereof 
5 The TBAs and BBAs which may be used according to the instant invention include any 

substance which can specifically bind to the TBRs and BBRs formed by hybridization of the PNAs, 
TNAs aad BNAs. Use of DNA binding proteins forms one example of such substances. 

For this example, the TBA is the dimer of the DNA binding portion of p50, and the BBA 
is the lambda cro protein. These proteins may be produced according to methods known in the art 

10 The genes for both of these proteins have been cloned. Thus, these proteins are recombinantly 
produced and purified according to methods known in the art Furthermore, these proteins are 
labeled, either with a radioisotope, such as radioactive iodine, or with an enzyme, such as beta- 
galactosidase or horseradish peroxidase, or with a fluorescent dye such as fluorescein or rfaodamine, 
according to methods well known in die art In addition, either or both of the TBA and BBA may 

15 be immobilized on a solid surface such as the surface of a microtitcr plate or the surface of a bead, 
such as a colored bead of diameter anywhere from 0.01 to 10 |iin. The labels on the TBAs and 
BBAs may be the same or different 

In this example, the TBA containing the dimeric pS 0 DNA binding domain is labeled with 
rhodamine, while the BBA, cro, is labeled with fluorescein. Accordingly, upon hybridization of the 

20 PNAs, TNAs, BNAs and HNAs as described in this patent disclosure and the foregoing and 
fallowing examples, the nucleic acid hybrids, if formed, are contacted with excess labeled TBA and 
cro. The fluorescence of these labels is measured according to known methods and, detection of 
both signals is indicative of the presence of 1/2 TBR sequences in the TNA. The differential signal 
produced by the fluorescence of the NF-kB and cro is a measure of the degree to which the 

25 polymerization of BNAs onto the PNA-TB A hybrid has resulted in amplification of the signal. 
Amplification from one to over a thousand fold is contemplated according to the method of this 
invention. 

Example 5 - Hybridization of two PNAs with a TNA and Discrimination Between a TNA and a 
30 £NA 

The PNAs, PNAI, SEQ ID NO. 40 and PNA2, SEQ ID NO. 45, are used in about ten-fold 
molar excess over the concentration of TNAs in a test sample. For this example, an isolated duplex 
HIV LTR, wherein ne strand of which has the sequence SEQ ID NO. 37, shown in Figure 7, and 
the other strand of which is complementary to the sequence shown in Figure 7, is used as the TNA. 



WO 96/17956 



PC TAJS95/15944 



49 

A duplex isolated CNA is also used in this example, one strand of which has die same sequence as 
SEQIDNO. 37, except that, in the first NF-kB binding site shown in Figure 7, at the center f the 
binding site, position 1 in Figure 7, instead of a "T there is an "A," the complementary strand of 
which therefore mismatches with the SEQIDNO. 40 PNA at that location. 
5 SEQ ID NO. 40 and SEQ ID NO. 45 arc both added to separate reactions, die first 

containing the above described TNA and the second containing the above described CNA. The 
samples arc solubfliznd in an appropriate hybridization buffer, such as 10 mM Tris (pH 7.5), 1 mM 
EDTA. The samples are heated to about 90°C for about five minutes to strand separate the duplex 
TNAs and CNAs in the samples, and then the samples are allowed to cool to allow strands of PNAs, 
10 TNAs and CNAs to anneal. 

Once the hybridization has gciie 
methods such as by calculating the tl/2 based on base compositions and annealing temperature 
according to known methods, the SEQ ID NO. 40 FNA is polymerized by addition of BNAs as in 
Example 2 and the SEQ ID NO. 45 PNA2 probe is polymerized with BNAs starting with Sphl 
15 recognition site overhang. Following addition of the BNAs and a brief hybridization period, the 

separate samples are added to beads coated with covalently immobilized NF-kB , and the NF-kB is 
allowed to bind to any TBRs formed in the TNA and CNA samples. After about 15 minutes of 
binding, the samples are wasted twice with about three volumes of an appropriate washing buffer, 
such as 10 mM Tris, pH 7.5, 1 00 mM NaCt or another buffer pre-determined not to interfere with 
20 NF-kB, or bacteriophage lambda CI repressor protein binding activity. After each wash, the beads 

are allowed to settle under gravity or by brief centrifugatioa This removes any nucleic acids which 
do not have a perfect NF-kB binding site formed by hybridization of the PNA1 and TNA sequences. 

After the final wash, bacteriophage lambda CI repressor protein labeled with a radioactive 
isotope, such as with radioactive iodine, or labeled with an enzyme, such as horseradish peroxidase, 
25 with colored beads, or with a fluorescent label is added to each sample. The samples are thai 

washed several times (about 3) with several volumes (about 2) of an appropriate washing buffer such 
as 10mMTris,pH7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF- 
kB, or bacteriophage lambda CI repressor protein binding activity. After each wash, the beads are 
allowed to settle under gravity or by brief centrifugation. Following the last settling or 
30 centrifugation, the bound label is quantitated by detecting the bound radioactivity, liberated color in 

an enzymatic assay, color f bound beads, or fluorescence detection. Alternatively, an anti-CI 
antibody can be added and a standard sandwich enzyme linked immunoassay or radioimmunoassay 
performed to detect bound repressor. In addition, as a negative control (background), all of the 
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foregoing manipulations arc carried out in tandem with a sample in which beads are used having no 
immobilized NF-kB. 

As a result of the foregoing assay, the control and CNA containing samples have similarly 
low signals while the TNA containing sample has a signal well above background 

5 

Example 6 - A Test Kit for the Detecti on of HIV 
A, Kit content s; 

1. Microtiter plate. 

2. 1 mg/mL solution of reoomhinantly produced NF-kB in tris-buffered 
10 saline. 

3. Tube containing single stranded HTV PNAs (a mixture of pre-mixed 
oligonucleotides encoding two NF-kB 1/2 binding sites, i c. a mixture of 
SEQ. ID.Nos.7and8). 

4. Tube containing single stranded human genomic PNA, SEQ IDNO. 1. 
15 5. Tube of nuclease (Art). 

6. Tube of protease. 

7. Tube containing pre-polymerused BNA's, 100 repeat units of 
bacteriophage lambda Or, capped with an HNA but with firte 1/2 BBRs 
available for binding to PNA-TNA hybrids. 

20 8. Tube of horseradish peroxidase (hrp) conjugated cro. 

9. Tube of hip colored substrate. 

10. Tris buffered saline, 100 mL. 

11. Lancet 

12. Reaction tubes A, B, C t each containing 2S0 uL of distilled water. 
25 13. Medicine dropper. 

3, A$g3Vmgth«l; 

(a) The microtiter plate (item 1) is coated with the solution of recombinant^ produced 
NF-kB (item 2) at a concentration of I mg/mL in tris buffered saline overnight at 

30 4 °C with rocking. 

(b) Three drops of blood of the test taker is obtained by pricking a finger with the 
lancet (reagent 1 l) t and a drop of blood is dispensed into each of reaction tubes A, 
B, and C (reagent 12). 
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(c) Into each tube is dispensed one drop of protease solution (reagent 6) with the 
weiring dropper (item 1 2) and the tube agitated and allowed to sit for 5 minutes. 

(d) One drop of nuclease (item 5) is added to each of tubes A-C using the medicine 
dropper and the tubes agitated and allowed to sit for 10 minutes. 

5 (e) One drop of item 3 is added to tube A (test sample); one drop of item 4 is added 

to tube B (positive control); and one drop of saline (item 12) is added to tube C as 
a negative control The tubes art heated to 50°C in hot water and allowed to cool 
to room temperature over ooc hour. 

(f) Whik the hybridization is alk^ 

10 from the surface and the microtiter plate, from step (a), and the plate is rinsed with 

tris buffered saline (tube 10). 

(g) The contents of tubes A-C from step (e) art transferred to three wells of the 
microliter plate and allowed to stand for 1 hour with rocking. 

(h) The microtiter wells containing the contents of tubes A-C are rinsed with tris 
IS buf&rcd saline and emptied. 

(i) One drop of item 7 is added to each well and allowed to hybridize with any 1/2 
BBR sites bound to the plate, over one hour, followed by three rinses with tris 
buffered saline, 

(j) One drop of item 8 is added to each well and cro is allowed to bind to any bound 
20 BNA'sovcr 10 minutes, followed by five, one mL washes with tris-buffoed saline. 

(k) One drop of hip substrate is added to each well and color allowed to develop. 



If wcDs A and B both show odor development, and well C does not, the test is valid and the 
25 subject has been infected with HIV. If only well A shows color development, or if well C shows 

color development, the test has been performed incorrectly, and is invalid If wells A and C show 
no color development but weDB does, the test is valid and the individual has not been infected with 
HIV. 



30 EttfmipTe 7 - Production of Various Novel TBAs 

Novel TBAs for use according to the instant invention are prepared as follows: 
( a) NFkB/NF-kB fHIV-Detcct I) . A nucleic acid encoding any one ofSEQ ID NOS. 63-71 
or a like NF-kB DNA binding protein, is fused, in frame, to a nucleotide sequence encoding an 
assembly sequence, such as cn?, such that the NF-kB DNA recognition sequence is encoded at amino 
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or carboxy tcnninus of the cro sequence . Optionally, a linker sequence is provided between the NF- 
kB sequence and the cro sequence. At the other terminus of cro, a nuclear localization signal 
sequence, such as SEQ ID NO* 72, is optionally provided. Further, asymmetry sequences are 
optionally provided at the cro terminus unused by the NF-kB recognition sequence. Examples of 
5 complete TBAs ait shown below. 

(b) NF-kB/SPl fHIV-Detcct ID . In a similar fashion to that described in (a) above, a 
recombinant coding sequence encoding an NF-kB recognition domain is prepared. In a separate 
construct, instead of SEQ ID NOS, 63*72, the coding sequence for the DNA recognition portion of 
SP1 is included Such a sequence should encode all or a functional part of SEQ ID NO. 73, which 

10 is that portion of the SP1 transcription factor exhibiting DNA binding (see Kadonaga et al [1987] 
Cell 51:1079-1090). The W-kB-encoding vector and the SPl-encoding vector are then co- 
transfected into an appropriate expression system such as is well known in the art. Amonomeric 
NF-kB recognition unit is added to complete die NF-kB recognition dimer after the assembly of the 
SP1 and NF4cB recognition units by 

15 of NF-kB or SPI dimers and direct, instead, the formation of NFkB-SPl beterodimers (i.e., HTV- 
Detect 0), which are then isolated from the expression system (nmmmalian or bacterial cells) by 
known methods. 

(c) SP1/SP1 TBAs ftnV-Dctcct III) . As described in (b) above, an SPl-encoding TBA 
construct is prepared. However, only this construct is transfected into the expression system, and 

20 asymmetry sequences allowing the formation of SPI -SPI dimers are included 

(d) SPI -TATA (HIV-Dctect IV) . As described in (b) above, an SPl-encoding TBA 
recombinam is produced In addition, a recombinant encoding a TBA having the binding sequence, 
SEQ ID NO. 74 r or like sequence encoding a TATA recognition unit is prepared with asymmetry 
sequences complementary to those included in the SPI TBA-encoding construct. These constructs 
yry f^tnmfrrtrrf f*^ t*>» h^tmivmm isolated by standard methods, including affinity purification 
on a DNA column having die appropriate SPI -TATA target binding regions. 

(e) SP1-E2 (HPV-Dctect I) . An SP 1 -encoding construct is prepared as in (b) above. An 
E2 TBA-cncoding construct is prepared by using a sequence encoding any one of SEQ ID NOS. 75- 
84 and 94-98 which are papillomavirus E2 DNA recognition units (see Hegde et al [1992] Nature 
359:505-5 12) or like recognition units, is prepared and co-transformed or cc-transfectcd with the 
SPI TBA-cncoding construct Monomelic E2 recognition unit is added to the complete E2 
recognition dimer after the assembly of the E2-SP1 recognition unit by the chaperone. The 
heterodimer HPV-Detect I is isolated according to known methods. 
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(fi E2-E2 (HPV-Detcct m . As described above in (e), an E2 TBA-eacoding construct is 
prepared, except that asymmetry sequences are included which permit the formation of E2 dimers. 
The expressed dimers arc tira isolated by known methods including affinity for a dimeric E2 binding 
site on a DN A affinity column. 
5 (?) E2-TATA (HFV-Dctoct DP. As described above in (e) and (d) 9 E2 and TATA binding 

TBAs ait prepared (respccbVcjyX except that asymmetry scquenocs are included which enhance the 
formation of he tmxtkr m rather than homodimers. These constructs are then co-expressed and the 
hetcrodimers are isolated. 

OA TATA-TATA fflPV-Dctoct TV). As described above in (a) and (d) f a TATA binding 
10 TBA-eacoding construct is prepared using asymmetry sequences that encourage this hnmoriimcr 
formation and the homodimer is isolated 

(i) Other TBAs . As described above for HIV and HPV TBAs, TBAs for any given 
pathogen or disease state may be produced by identifying specific DNA binding proteins and 
forming an expression construct using appropriate linker, assembly, and asymmetry sequences. 

IS 

]Examplc8 

In a similar fashion to the assay described in Exampk 5, a more stringent assay is produced 
by using the duplex OT-kB-SPl binding protein prepared according to Example 6. Accordingly, the 
probes shown in Figure 7 and used in Example 5 may be lengthened to reduce the inierprobe distance 
20 and thereby reduce the flexibility of the DNA in the TNA. 

Eyamplg 0 - Pro duction of "Hfeh-Ordcr* TBAs 

By the appropriate use of asymmetry sequences, TBAs are produced which are dimers, 
trimcrs, tctrameres, pentaraers, or hexamers of particular DNA recognition units. In this fashion, 

25 a hexameric TBA is produced by making a first NF-kB p50 dimeric TBA using asymmetry 
sequences which enable dimer formation. In addition, the asymmetry sequences enable the 
tetramerization of the p50 dimer with an SP1-SP1 dimer. Finally* additional asymmetry sequences 
direct the bexamerizatioo with a dimer exhibiting nuclear localization sequences. This is 
accomplished by incorporating, for example, asymmetry sequences from insulin, which in nature 

30 forms hexamers. This hexamcr formation is directed by the sequences, SEQ ID NOS. 85 (A) and 
86 (B), 87 (A) and °i (B), 89 (A) and 90 (B) t and 91 (A) and 92 (B) (see Figures 13 and 14), 

Because of the extremely high affinity for the HTV-LTR that can be generated using a 
multimeric TBA, the compounds having this structure and which can be used for this purpose are 
referred to herein as "HTV-Lock" 
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An optimal HIV-Lock is defined by f ootprinting (according to methods well known in the 
rt)TBAs bound to TBRsm the HIV f each DN A binding 

protein contributing to die formation of the multimeric TB A complex is downshifted relative to the 
affinity for any natural target sequence (i.e. CNAs) from which the DNA binding recognition unit 
5 of the TBA is derived Any concomitant loss in binding affinity for the HIV TBRs is more than 
rer tyfn^itwi for »jvm farmattftn of the imiltimer as described below. 

There may be competition between the binding of each component TBA for its TBR and 
assembly, via asymmetry sequences to form the multimer. This is obviated by adjusting the linkers 
between be chapercne and asymmetry sequences in each TBA campooent such that the^ 

10 events are uncoupled The resultant reduction in the dimensionality of diffusion (effective 
concentration increase) for the TBA asymmetry and assembly components results in efficient 
formation of the multimeric complex. 

On the basis of the foortprinting, the length and composition of linkers is adjusted to achieve 
optimal discrimination between target HIV sequences and natural sequences. In this fashion, 

15 ftithmig h «iph mmprwient TBA will have a low affinity for CNA and TBR sequences, the multimeric 

complex will have an extremely high affinity for the now expanded TBR recognized by the 
multimeric complex (the square of the affinity of each TBR recognized by each component TBA of 
the multimeric TBA), while still having a low affinity for CNAs. , In the same fashion, other 
multimeric TBA complexes, aside from HIV-Lock, are prepared. 

20 TBAs which can be fanned in this fashion include the following sequences, which are 

assembled by linking either the protein subunits or nucleic acid sequences encoding these subunits, 
as follows: 

Set Link Sequence s from Groups 

a i+n+ni 
25 b w + v + m 

c iv+m 

wherein groups 1-V consist of sequences selected from: 
Group Selected from Sequences 
I Any of SEQ ID NOS. 85-92 

30 n MctSa, linked to any of SEQ ID NOS 104-106, cadi of which is linked 

toSEQIDNa 99. 

ID SEQ ID NO. 100 linked to any of SEQ ID NOS. 75-84 or 94-98; SEQ ID 

NO. 101 linked to either SEQ ID NO. 74 or SEQ ID NO. 93; or SEQ ID 
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NO. 102 linked to SEQ ID NO. 74 or SEQ ID NO. 93; or any of SEQ ID 
NO. 72, 103, 73, or 63-71. 

IV Any of SEQ ID NOS. 104-106. 

V SEQ ID NO. 99. 

S Specific examples of such TBAs ore SEQ ID NOS. 109-1 16, assembled as follows: 



Sfit 


SEOJDNQ, 




A 


109 


85 + Met Ser + 104 + 99 + 100 + 94 


A 


110 


85+ Met Ser +104 + 99 + 72 


A 


111 


86 + Met Ser + 105 + 99 + 102 + 74 


A 


112 


86 + Met Ser +106 + 99 + 73 


A 


113 


89 + Met Ser +106 + 99 + 63 


C 


114 


106+64 


C 


115 


105+64 


B 


116 


106 + 99 + 73 



15 

In this fashion, choosing between appropriate asymmetry sequences, assembly sequences, 
and DNAroogniticn units, m Furthermore, sets of these, such 

as SEQ ID NOS. 114 and 115, will associate with each other but dimereofSEQ© NO. 114or 115 
will not form due to charge repulsion in the mutated assembly sequences (SEQ ID NO. 104 is cro, 
20 SEQ ID NO. 105 is a novel mutated, negatively charged cro f and SEQ ID NO. 106 is a novel 
mutated, positively charged cro). 

Naturally, given flic amino acid sequence of these TBAs, one of ordinary doll could produce 
recombinant nucleic acid clones encoding these, and such recombinant clones naturally form an 
integral part of this invention. 

25 

Example 10 -HIV Test Using "HIV-LOOT 

In much die same method as used in Example 6 f the "HIV-LOCK" produced according to 
Example 9 is used as the TBA, reagent 2, with similar results. 



30 



Ettmri pfe 1 1 - HIV Test Using "HIV-LOCK" When Testing Blood for Donation 

When the quantity of blood to be tested is not limiting, as when samples of blood for 
deflation are to be tested for HIV contamination, tests similar to Example 6 art run, but for each of 
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tubes A-C, aboul 5 mL of blood is pelleted in a tabletop centrifuge. Other reagents are scaled up as 
necessary to handle the larger quantity of TNA present in the sample. 

Example 12 - "HIV-LOCK" as an Anti-HTV Therapeutic Agent 
5 "HTV-LOCK" produced according to Example 9 is formulated as a 1 mg/mL solution in 

liposomes and injected intravenously into a subject vrix> has been tested and confirmed to be infected 
with HIV. A dose of about 0,1 mg to lOOmgof w HIV-LOCK*7kflo^ 

a twenty-four hour period and the concentration of HIV p24 in the patient's serum monitored The 
treatment is repeated as often as necessary, such as when elevations in the serum p24 occur. 

10 

Example 13 - Use of an HIV-TBA Construct as a Therapeutic 

Arecambii^icftoviralorlike vector is used to deliver a construct encoding an HIV-LTR 
binding TBA to an infected patient The vector encodes a chaperone, such as cro > and sequences 
DNA for binding portions of p50. The same vector also encodes a chaperone on which an SP1 TBA 

IS folds. Asymmetry sequences are provided such that upon co-expression of the p50-TB A and the 
SP1-TBA in a single HIV infected cell in vivo, an immediate association occurs between these 
TBAs, while at the same time preventing any association between the DNA binding portion of pSO 
and endogenous pSO or p65 monomers. NLS sequences are also provided in the TBAs so that, upon 
rfimw formation, the TBA immediately relocates to the nucleus of the cell and binds specifically to 

20 integrated HTV sequences, thus preventing any transcription from that locus. 

For this purpose, it is desirable to select sequences encoding DNA binding domains such 
that the expressed monomers are assembled into a TBA which does not bind to natural human 
sequences. Thus, it is only upon binding of the TBA components to their target sequences that 
association between all components of the TBA occurs to form a complex which tightly and 

25 specifically binds the HTV LTR. 

Example 14 - Diagnostic Test Kit for Human Papillomavims 

This diagnostic for luxmanpapiUocnavzrus takes advantage of the known differential between 
benign and carcinogenic HPV to provide a test which indicates the susceptibility to malignancy in 
30 a patient. The papillomaviruses art a group of small DNA viruses associated with benign squamous 
epithelial cell tumors in higher vertebrates. At least 27 distinct human types of papillomaviruses 
(HPVs) have been found; many of these have been associated with specific clinical lesions. Four f 
these, HPV-6, HPV- 11, HPV- 16, HPV- 18, and HPV-33 have been associated with human genital 
tract lesions. In general, HPV-6 and HPV-1 1 DNAs have been found associated with benign lesions 
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f the genital tract. HPV-16, HPV-18, and HPV-33 have also been found associated with 
prgfrmTignjmt tmA malignimt lesions and art transcribed in most cell lines established from cervical 
carcinomas. HPV-16, HPV-18, and HPV-33 are likely to be only two members of a large set of 
HPV DNAs associated with malignant human cervical carcinomas. 
5 Animal models have shown that benign papillomavirus lesions can progress to m^iign^f 

lesions in the presence of a co-carcinogen. HPV DNA has been found in metastases of cervical 
carcinomas. In malignant cervical lesions, HPV DNA is usually integrated into Ac human genome, 
but there may also be extracfaromosoma] HPV DNA present Integration of HPV to form die 
provirus usually results in the disruption of the viral E2 open reading frame (ORF). Despite 
10 disruption of the E2 ORF, and examination of cell lines from several cervical carcinomas has shown 
transcriptionally active and integrated HPV-16 and HPV-18. When HPV-16 genomes which are 
present in the human cervical carcinoma cell lines SiHa and CaSki have been examined, there are 
differences found in the integration of HPV-16. In the SiHa line, the single HPV-16 genome 
integration occurred Abases 3132 and 3384, disrupting the El and E2 ORFs with a deletion of 0.3 
IS kb. An additional 50-basepair deletion of HPV-16 DNA resulted in the E2 and E4 OFRs being 
fused. The 5' portion of the HPV-16 DNA, consisting of the disrupted E2 ORF, is ligated to 
continuous human right flanking sequences. In addition, a single additional guanine is detected at 
nucleotide 1 138 in the middle of the E I ORF. This basepair addition results in die fusion of the El a 
and Elb ORFs to a single El ORF. 
20 The complete genome of HPV-16 is available on GenBank as accession number K02718; 

the complete genome of HPV-33 is available on GenBank as accession number M12732; the 
complete genome of HPV- 1 8 is available on GenBank as accession number X05015. 

As a preliminary screen, the fact of an HPV infection is established for a given cervical 
biopsy sample by a simple "ye&foo" type of analysis using, for example, any or all of the PN As SEQ 
25 ID NOS. 46-53 and an E2 TBA as described above (U, 9 fragment DNA, binding the PNA, 
immobilize with the TBA, and detect signal with BN As and BBAs). 

Onoe a biopsy sample is found to be positive far HPV r additional information is obtained 
as to the malignancy potential of the HPV by analyzing the integration status of the virus in the 
human genome. 

30 1. Fragment the DNA in the cervical biopsy sample and hybridize to a blocking probe 

having the sequence, SEQ ID NO. 60 This probe will bind to all the fragments in 
the DNA which have not spliced out the 0.3 kb fragment 
2. Expose the DNA in the biopsy sample to a PNA having the sequence, SEQ ID NO. 
61. This probe will only bind to fragments which have deleted the 0.3 kb fragment 
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(the blocking probe will prevent the looping out of the large deletion segments if 
present). 

3. A PNA having SEQ ID NO. 62 is hybridized with SEQ ID NO. 41 to form a BBR 
which will bind to cro or X CI repressor as a BBA, leaving a single-stranded 

5 portion capable of bytsidizmg with the TATA site on SEQ ID NO. 6 1 . This added 

to form a TBR on the 5 ' end of the large deletion. 

4. The TBR is immobilized by a TBA having a TATA binding protein DNA 
recognition unit 

5. The bound fragment are detected by adding BNAs and BBAs as described above. 
1 0 Detection of signal in this assay indicates that die large fragment is deleted in HPV present 

in theTNA. Since this deletion is correlated with malignancy, this assay provides insight into the 
malignancy potential of the HPV infection. This conclusion can be confirmed by performing an 
analogous assay based on the deletion of the 52-basepair fragment which is also correlated with 
HPV-induced malignancy. 

IS The TB P recognition unit used in the TBA for this ass ay may be chosen, for example, from 

a sequence such as SEQ ID NO. 70 or SEQ ID NO. 93. 

Exam ple 15 - RtcopfrfrfPt pTV.y^K™ PmA^n 

Phase One - Preparation of DNA to Produce the HTV-Lock™. In vitro mutagenesis of 

20 the coding regions of the naturally occurring, cloned components of the HIV-Lock™ which need to 
be modified is performed with a MutaGcne Phagemid kit The modified protocol includes the use 
of a Blue-script plasmid containing each of the binding components of HIV-Lock™. These are 
transformed into competent cells and uraril-containing phagemids are grown. Single stranded DNA 
is extracted and used as a template for the mutagenic strand Oligonucleotides containing the desired 

25 mutations, including the incorporation of a novel restriction site, are synthesized and treated with 
polynucleotide kinase and ATP. The kinase treated oligonucleotides are annealed to the single- 
stranded template, and a mutagenic strand is synthesized and ligated according to the MutaGcne 
protocol, with the exception that Sequenase 2.0 provides the polymerase. Libraries are screened 
using both g-"P end-labeled nucleotides containing sequences complementary to the introduced 

30 mutations and by isolating the plasmid DNA and identifying the mutants by the presence of the 
introduced restriction site. The mutations are also confirmed by sequencing with a Sequenase kit. 
The HIV-Lock™ DNA is cloned into the baculovinisexprcssMn system with a p 

Phase Tw - Production of HIV-Lock™ Proteins Using Baculovirus. Sf-9 cells are 
cultured to a pre-detennined density (about 1x10* cells/ml, log phase), infected with the baculovirus 
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cootaining the HIV-Lock™ instructions and harvested to recover the recombinant proteins 
comprising the HIV-Lock 1 *. In the scale-up process, cultures are expanded from flasks to spinners 
and subsequently to bioreactors. Following infection the cells are harvested at 12, 24, 36 and 48 
hours for the protein. Indices of viability are monitored throughout the entire process. 
5 Phase Three - Purification of the HIV-Lock™ Proteins The harvested proteins are first 

separated from particulates by flow-through uttraccntrifiigation to facilitate downstream purification. 
The centrifiiged product is then sterile filtered Extracts are then centrifuged at 40,000 rpm at 4° 
C fir 30 minutes and aliquots are immunopredpttated with polyclonal rabbit antibody against one 
of the HIV-Lock™ components. Iminunoprecipitated proteins are nm on an SDS- 10% PAGE gel. 
10 Phase Four -Test of HlVJx^^Proienu Against HIV DNA Mobility shift assays are 

carried out using an oligonucleotide probe comprising elements of the HIV long terminal repeat and 
fragments containing NFKB binding DNA associated with kappa light chain and microglobulin 
regulation. The oligonucleotide is annealed to its complimentary strand and end-labeled with g- 32 P 
ATP. 

IS Footprinting is acccmpUskd by combining small (Ifr 15 M) of radiolabeled HIV LTR DNA 

with a slightly larger amount of HIV-Lock™ in a buffer at room temperature for 10 minutes. 
Dithiothreitol is added prior to the addition of protein. Iron (II), EDTA, hydrogen peroxide and 
sodium ascevbate are added and the reaction mixture is incubated. A quenching agent is added and 
the products are analyzed suing denaturing gel electrophoresis. This is done for different 

20 concentrations of protein. The resulting gel is imaged using a phosphoimager scanner and the 

resulting high resolution image file is analyzed to abstract the binding affinity of HTV-Lock™for the 
HIV DNA relative to cellular DNA. 

Multiple design and testing iterations may be used in order to refind binding of HIV-Lock™ 
and other TBAs for HIV and other organisms. This process makes it possible to design binding 

25 assemblies such that the binding assembly is not competitive with the wild type proteins for single 

binding sites in Ac genome samples. The development of TBAs for other organisms and TNAs for 
sequences within these organisms can be made using the aforementioned method This method is 
valid when producing binding assemblies for all nucleic acid TBRs including DNA-DNA, DNA- 
RNA and RNA-RNA hybrids and combinations of these hybrids. 

30 

Example 16 - M ethod for identifying nucleic acid binding molecules for production of TBAs and 

BBAsoftofrventipp; 

In the method f this invention, target binding assemblies and booster binding assemblies 
are assembled by identifying nucleic acid binding molecules, and linking the nucleic acid binding 
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portions of the molecules in such a fashion as to achieve TBAs which discriminate between 
particular target sequences and even closely related sequences. One method for identifying the 
nucleic acid binding molecules involves the following steps: 

1. Obtaining a biological sample containing the target nucleic acid. This could be, for 
example, an organism or a tissue extract infected with a pathogen. 

2. Fragmenting the sample so as to expose the nucleic acids and to reduce the size complexity 
of the nucleic acids contained in the sample. 

3. Contacting a first aliquot of the fragmented nucleic acids with a control buffer medium and 
contacting a second aliquot of the fragmented nucleic acids with the control buffer medium 
containing a known profile of nucleic acid binding molecules. 

4. Analyzing the two aliquots to identify fragments which have altered behavior in the aliquot 
contacted with the target binding molecules as opposed to the control aliquot. This is 
accomplished by single dimension gel electrophoresis, two dimension gel electrophoresis, 
high performance liquid chromatography, paper chromatography or any other means which 
reveals a different behavior of the nucleic acid fragments when bound to a nucleic acid 
binding molecule as opposed to when the nucleic acid fragment is unbound 

5 . Identifying and isolating fragments which do exhibit altered behavior when contacted with 
the nucleic acid binding molecule and either sequencing the nucleic acid fragment to 
determine whether known nucleic acid binding molecule motifs are present, or directly 
identifying the nucleic acid binding molecule bound to the nucleic acid. The latter can be 
achieved, far example, by contacting a two dimensional grid of the electrophoresed nucleic 
acids with differentially labeled antibodies which bind to the various nucleic acid binding 
molecules. 

In this method, preferably nucleic acid motifs are used for either diagnostic or therapeutic 
purposes wherein the target nucleic acid has more than a single utilizable nucleic acid binding 
molecule target. In this way, a complex target binding assembly can be generated which takes 
advantage of the proximity of different nucleic acid binding molecular motifs to enhance the 
specificity of the TBA assembled from the individual nucleic acid binding components identified. 
The various nucleic acid binding portions of the nucleic acid binding molecules are then assembled 
into the complete TBAs as described above, for example, for HTV-LOCK™, 

Example 1 7 - Method of identifying specific RNA sequences in a sample 

According to the methods and compositions taught in this invention, any nucleic acid 
sequence can be specifically identified. Identification of target HIV RNA in a sample is achieved 
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by obtaining a sample of a patient's blood or other biological fluid or extract which may contain the 
HIV RNA, and testing for the presence of TAR binding sites. Tat is a positive regulator of HIV 
replication which binds to the TAR region of the HTV RNA. The smallest naturally occurring, fully 
active form of HTV-Tat is 72 amino acids in length, SEQ Id. 1 18 herein. Tat contains at least two 
5 functional domains, and transactivates gene expression from the HIV long terminal repeat (HIV 

LTR). Tat binds to an RNA stem loop structure formed from the self-hybridization of sequences 
mTAR,whidbisjust5*totlKHIVLTR HIV TAR RNA forms a dinucleotide b ulge and two stem- 
loop structures (Rhim et aL 1994 Virology:202, 202-21 1). The Tat (SEQ. Id. 118) binds to this 
structure with lower avidity than does Tat variants wherein AlaS8 is a threonine or where His65 is 
10 an Asp residue. (Derse et al. t 1993 Virology: 194,530-536). Utilizing these facts in the instant 
method is accomplished by: 

1, Fragmenting a biological sample to expose the nucleic acids and reduce the size complexity 

of the nucleic acids. 
2 GxitactmgaTBAwithlhesampkw^ 
15 arxi a proximate flanking sequence in the HTV genome. The TB A used for this purpose is 

assembled on era as die chaperone using Tat as the HIV RNA specific binding molecule. 
To provide specificity such that cross-talk between the HIV TAR site and closely related 
TAR sites which may be present due to such other pathogens as cytomegalovirus, the TBA 
also has an antibody component which recognizes the DNA-RNA hybrid target binding 
20 region formed when a probe nucleic acid binds to the HIV LTR RNA. 

3. Eliminating any "cross-talk" produced by binding of Tat to the TAR region of the HIV 
RNA due to such contaminants (cousin RNAs) as the CMV TAR sequence by contacting 
the reaction with excess Tat variant (either the Ala58 to Thr or the His65 to Asp variants) 
which bind more avidly. In this way, single binding events due to the TBA binding to a 
25 cousin RNAs are competed from the nucleic acid sample by the Tat variant. On the other 

hand, by appropriately selecting the affinity of the double binding achieved as a result of the 
antibody and Tat, the TBA is not displaced from true targets. This process is illustrated in 
figure 16. In another aspect of this same method, the TBA could be one in which, rather 
than using a variant of Tat, an antibody is used which recognizes this nucleic acid segment, 
30 and the TBA used is a double antibody TBA. 

In an alternate version of this method, a probe nucleic acid may be used which hybridizes 
with the HIV LTR RNA Accordingly, a duplex segment of the LTR sp I sites can be created as part 
of the target binding region. This region of the HTV RNA flanks the TAR region which is 5' to the 
LTR but is in close proximity thereto. A TBA containing Tat and two Spl binding units is 
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chaperoned to provide Tat binding to TAR and Spl binding to the Spl binding sites. Amplification 
and detection is then carried out by adding appropriate BNAs, BBAs and HNAs. In yet another 
alteraative, PNAs having Seq. ID. 38 and Scq. ID. 3 9 (see figure 7) could be used. A TB A is used 
which contains one or mare Sp 1 binding units and an antibody unit which binds to the DNA-RNA 
5 hybrid produced fiom sample RNA and the Scq. M. 38 PNA. Appropriate BNAs, BBAs and HNAs 
arc then added to amplify the sigpal. 

Naturally, those skilled in the art will recognize that other TBA and TNA combinations 
could be used to optimize the methods exemplified herein. 



10 It should be understood that the examples and embodiments described herein are for 

illustrative purposes only and that various modifications cr changes ta^ 

to persons skilled in the art and are to be included within the spirit and purview of this application 
and die scope of the appended claims. It will be understood that sequences provided herein art 
exemplary only and that other like sequences suggested by these could 
15 invention. It will also be undenrtood that ahhou^ any segue 

as linear, it could be used in a circularly or otherwise permuted fbnn and although designated as not 
being anti-sense, it could be used in the coding or non-coding form or to bind to coding or non- 
coding complementary sequences. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATIONS 



(i) APPLICANTS 



Applicant Mame(e)s 
Street address: 

city * 
State/Province s 
Countryr 
Postal coda/Zips 
Phone number: 



THE GENE POOL, INC. 

300 Queen Anne Ave* N. , Suite 392 

Seattle 

Washington 

us 

98109-4599 

(206) 526-8617 Fax numbers 



(ii) TITLE OF INVENTIONS METHOD OF DETECTION OF NUCLEIC ACIDS WITH A 
SPECIFIC SEQUENCE COMPOSITION 



(ill) NUMBER OF SEQUENCES S 118 



(iv) CORRESPONDENCE ADDRESS S 

(A) addressees salivanchilt a salivanchik 

(B) STREETS 2421 N.W. 41st St., Suite A~l 

(C) CXTTs Gainesville 

(D) STATES Florida 
<E) COUNTRY* USA 
(F) SIPs 32606 



(V) COMPUTER READABLE FORMS 

(A) MEDIUM TYPE: Floppy disk 

(B) computers IBM PC compatible 

(C) OPERATING SYSTEMS PC-DOS/MS-DOS 

(D) softwares Patent in Release f 1.0, version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATES 

(C) CLASSIFICATIONS 



(viii) ATTORNEY/AGENT INFORMATION s 
(A) NAMBs Bencen, Gerard H 
(B> REGISTRATION NUMBER: 35,746 
(C) REFERENCE/DOCKET NUMBER t GP-100.C1 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (904) 375-8100 

(B) TELEFAX: (904) 372-5800 



(2) INFORMATION FOR SEQ ID NOsi: 

(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY t lin ar 
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(11) MOLECULE TYPE* CDNA 
(ill) HYPOTHETICAL t NO 
<iv) ANTI -SENSE 1 HO 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: It 
TGGGGATTCC CCA 



(2) INFORMATION FOR SEQ ID NOi2t 

(X) SEQUENCE CHARACTERISTICS 1 

(A) LENGTH: 13 bass pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iV> ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 
AAGGGACTTT CCC 



<2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGGGGACTTT CCG 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS X 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS I both 

(D) T POLOGY: linear 
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(11) MOLECULE TYPE t CDHA 

(iii) HYPOTHETICAL: NO 
(iV) ANTI -SENSE i HO 

(Hi) SEQUENCE DESCRIPTION! SEQ ZD NO: 4: 
GCTGGGGACT TTCCA 15 

(2) INFORMATION FOR SEQ XD NO:5: 

(1) SEQUENCE CHARACTERISTICS S 

(A) LENGTHS 15 base pairs 

(B) TYPfis nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

<xi) SEQUENCE DESCRIPTION I SEQ ID NO:5: 
ACAAGGGACT TTCCG 15 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION I SEQ ID NO: 6: 
CCGGGTTTTC CCC 13 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) topology: linear 
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(ii> MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSES NO 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID KOt7: 
AAGGGACTTT CCGCTOGCGA CTTTCCA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTHS 27 base pairs 

(B) types nucleic acid 

(C) strandedness: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE I CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSEs NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOsSt 
AAGGGACTTT CCGCTGGGGA CTTTCCG 



<2> INFORMATION FOR SEQ ID NOt9t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 26 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GCTGGGGACT TTCCAGGGAG GCGTGG 



(2) INFORMATION FOR SEQ ID NOtlOt 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bas pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) topology: linear 
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(ii) MOLECULE TYPE l CDNA 
(ill) HYPOTHETICAL: HO 
(iv) ANTI -SENSE: HO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTGCGGACT TTCCAGGGGA CGTCTC 

(2) INFORMATION FOR SBQ ID NO tilt 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHt 26 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS t both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE* CDHA 
(iii) HYPOTHETICAL t NO 
(iv) ANTI-SENSE t HO 

<xi) SEQUENCE DESCRIPTION t SEQ ID NO* 11 I 
GCTCOOQACT TTCCGGGGAG CGTGGC 

(2) INFORMATION FOR SEO ID NO: 12 J 

(if SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 26 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iV) ANT I -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 1 
GCTGGGGACT TTCCGGOGAG GCGCGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 26 base pairs 
<B> type: nucleic acid 
(C) STRANDEDNESS: both 
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(D) TOPOLOGY t linaar 
(ii> MOLECULE TYPE: CDNA 
(iil) HYPOTHETICAL 5 NO 
(iv) ANT I -SENSE f NO 

(Xi) SEQUENCE DESCRIPTION f SEQ ID NO: 13: 
GCTGGGGACT TTCCAGAGAG GCCTGG 26 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH i 26 have pairs 
<B) TYPE i nucleic acid 

(C) STRAMDEDNS8S I both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

<iii) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTGGGGACT TTCCAGGGGA GGCGTG 26 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pa ire 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iV) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 1 15 1 
GCTGGGGACT TTCCAGGGAG GCGTGG 26 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 



WO 96/17996 



69 

(C) STRANDEDNBSS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE* CDNA 
(ill) HYPOTHETICAL t HO 
(IV) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION t SEQ ID NOll6t 
GCTGGGGACT TTCCAGGGAG GCTGCC 

(2) INFORMATION FOR SEQ ID HOtl7i 

|i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 33 bate pairs 

(B) TYPEi nucleic acid 
(C> STRANDEDNBSS s both 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE* CDMA 

(iii) HYPOTHETICAL J HO 

(iv) ANTI -SENSE: HO 

(Xi) SEQUENCE DESCRIPTION : SSQ ID NO: 17: 
TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL: HO 
(iv) ANTI -SENSE: HO 

(3d) SEQUENCE DESCRIPTION: SEQ ID HO: IS: 
CCTGGCCTGG GCGGGACTGG GGAGTGGCGT CCC 

(2) INFORMATION FOR SEQ ID HO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucl ic acid 



WO 96/1795* 



70 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES CDKA 
(lii) HYPOTHETICAL x HO 
(iv) ANT I- SENSE t BO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19s 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCT 

(2) INFORMATION FOR 8EQ ID NO:20t 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 46 baee pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS* both 
<D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: HO 

<xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 20 J 
CAGCAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT 

(2) INFORMATION POR SEQ ID NO:21l 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 46 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS I both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDKA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CATCAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 bas pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: both 

(D) topology: lin ar 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL x HO 
(Lv) ANTI -OTVSEs NO 

(Xi) SEQUENCE DESCRIPTION j SEO ZD NO: 22: 
CAACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT 46 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 45 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: both 

(D) topology s linear 
(ii) MOLECULE TYPES CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEO ID NO: 23: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCAT 45 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iV) ANTI -SENSES NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC GGGGAGCGTG GCCT 44 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS f 
(A) LENGTH: 44 baa pairs 
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(B) TYPE; nucl ic acid 
<C) STRANDEDHESS t both 

<D) TOPOLOGY t linear 

(ii) MOLECULE TYPE* CDNA 
<iii) HYPOTHETICAL: HO 
(iv) ANTI-SENSE: HO 

(3d.) SEQUENCE DESCRIPTION I SEQ ID NO:25: 
CTACAAGGGA CTTTCCGCTG CCGACTTTCC GGGGAGGCGC GGCT 44 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS 2 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS t both 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGAGAGGCGT GGACT 45 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS J both 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION t SEQ ID NO: 27: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGCG TGGACT 46 



(2) INFORMATION FOR SEQ ID NO:28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 46 bas pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDHESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES CDNA 
(ill) HYPOTHETICAL S HO 
<iv) AKTI -SENSE t 90 

(Xi) SEQUENCE DESCRIPTION! SBQ ZD N0t2Bt 
CTACAGOGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGGGAG 46 



(2) INFORMATION FOR SEQ ID HO:29: 

(i) SEQUENCE CHARACTERISTICS t 

(A) lengths 43 base pair* 

(B) types nucleic acid 

(C) STRANDEDNE6S t both 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPES CDHA 
(iii) HYPOTHETICAL t NO 
(iv) ANTI-SENSE s NO 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO! 29: 
CTACAGGGGA CTTTCCGCTC GGGACTTTCC AGGGAGGCTG CCT 43 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTHS 46 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDSDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO:30: 
CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGC CTGGGCG GGACTGGG 48 



(2) INFORMATION F R SEQ ID NO:31: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) length t 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

<ii) MOLECULE TTPE: CDKA 
(iii) HYPOTHETICAL: MO 
<iv> ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION* SEQ ID NO: 31; 
TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGGGAGTGGC GTCCC 45 



(2) INFORMATION FOR SEQ ID NO: 32; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 59 baae pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNE8S : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE I CDKA 
{iii) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGG 59 



{2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t both 

(D) topology : linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL t NO 
(iv) ANT I -SENSE: NO 

<Xi> SEQUENCE DESCRIPTION 2 SEQ ID NO:33$ 
TTTCCGCTGG GGACTTTCCA gggaggcgtg GCCTGGGCGG GACTGGGGAG TGGCGTCCC 59 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 70 bate pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE l CDMA 
(iii) HYPOTHETICAL l NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTIONS 8EQ ID NO: 34: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGGA 60 
GTCGCGTCCC 70 



(2) INFORMATION FOR SEQ ID NO: 35: 

<i) SEQUENCE CHARACTERISTICS t 

(A) lengths 61 base pair* 

(B) types nucleic acid 

(C) strandednesss both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TATCACCGCC AGTGGTATTT ATGTCAACAC CGCCAGAGAT AATTTATCAC CGCAGATGGT 60 
T 61 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSS both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE t CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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TATCACCCCA AGGGATAAAT ATCTAACACC GTGCGTGTTG ACTATTTTAC CTCTGGCGGT 60 
GATA 64 



(2) INFORMATION FOR SEQ ID NO $37 1 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 70 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEONESS : both 

(D) TOPOLOGT t linear 

(ii) MOLECULE TYPE: cDNA 
<iii) HYPOTHETICAL J NO 
<iv) ANTI -SENSE I NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTACAAGGGA CTTTCCGCTC GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGGGA 60 
GTGGCGTCCC 70 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS I 
<A) LBNGTHt 37 base paira 

(B) TYPE: nucleic acid 

(C) STRANDEDVESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(IV) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO; 38: 
CTACAAGGGA CTTTCCGCTC GGGACTTTCC AGGGAGG 37 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 
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(iv> ANTI -SENSE t HO 

<xl> SEQUENCE DESCRIPTION: SEO ID NO:39t 
CGGGACTGGG GAGTGGCGTC CC 22 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 
<B) TTPEs nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDKA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NOt 40: 
CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGTAT CACCGCCAGT GGTATTTATG 60 
TCAACACCGC CAGAGATAAT TTATCACCGC AGATGGTTCT GCA 103 

<2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 62 base pair ft 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) ma 
<iii) HYPOTHETICAL: NO 
<iv) ANTI -SENSE t NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
GAACCATCTG CGGTGATAAA TTATCTCTGG CGGTGTTGAC ATAAATACCA CTGGCGGTGA 60 
TA 62 



(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 bas pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



WO 96/17956 



FCT/US9SH5944 



(ii) MOLECULE TYPE J CDKA 



(iii) HYPOTHETICAL: NO 



(iv) ANT I -SENSE: HO 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 42: 



GATCCAACCA TCTGCGGTGA TAAATTATCT CTGGCGGTGT TGACATAAAT ACCACTGGCG 



60 



GTGATACTGC A 



71 



(2) INFORMATION FOR SEQ ID NO: 43: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GTATCACCGC CAGTGGTATT TATGTCAACA CCGCCAGAGA TAATTTATCA CCGCAGATGG 60 
TTG 63 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GATCCGGGGG GATACCCCCC G 21 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH* 91 baa pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS* both 

(D) TOPOLOGYs linear 

(li) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: BO 
(iv) ANTI -SENSE t NO 

(Xi) SEQUENCE DESCRIPTION! SEQ 10 NO: 45: 
CGGGACTCGG GACTOGCGTC CCTATCACCG GAAGGGATAA ATATCTAACA CCGTGCGTGT 60 
TGACTATTTT ACCTCTOGCG GTGATAGCAT G 91 



(2) INFORMATION FOR SBQ IP NO* 46 3 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t both 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE i CDNA 
(iii) HYPOTHETICAL I NO 
<iv) ANTI-SENSE t NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO* 46: 
CTAAGGGCGT AACCGAAATC GGTTGaACCG AAACCGGTTA GTATAAAAGC AGA 53 



(2) INFORMATION FOR SEQ ID NO: 47: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS $ both 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 47: 
AAAAGGGAGT AACCGAAAAC GGTCGGGACC GAAAACGGTG TATATAAAAG ATGT 54 



(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : S4 baae pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDMA 
<iii> HYPOTHETICAL! NO 
(iv) ANTI -SENSES NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 48: 
AGTAGGGTGT AACCGAAAGC GGTTCAACCG AAAACGGTGC ATATATAAAG CAAA 54 



(2) INFORMATION FOR SEQ ID NO:49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTEs 24 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS t both 

(D) topology s linear 

(ii) MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL t NO 
(iv) ANTI -SENSES NO 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 49: 
GCTTCAACCG AATTCGGTTG CATC 24 



(2) INFORMATION POR SEQ ZD NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: both 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N :50: 
TGTGCAACCG ATTTCGGTTG CCTT 24 



(2) INFORMATION FOR SEQ ID NO:51: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) types nucleic acid 

(C) STRANDEDNESS: both 

(D) topology: linear 

(ii) MOLECULE TYPE: CDHA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TATGCAACCG AAATAGGTTC GGCA 24 



(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: both 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION i SEQ ID NO: 52: 
TGCCTAACCG TTTTCGGTTA CTTG 24 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNBSS : both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION X SEQ ID NO: 53: 



GGACTAACCG TTTTAGGTCA TATT 



24 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH i 52 base pairs 

(B) TYPBs nucleic acid 

<C) STRANDEDNESSs both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE J CDNA 

(iii) HYPOTHETICAL * NO 

(iv) ANTI -SENSES NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOs54t 
GACGACTATC CAGCGACCAA GATCAGAGCC AGACACCGGA AACCCCTCCC AC 52 



<2) INFORMATION FOR SEQ ID NOs55: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTHS 53 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESSs both 

(D) topology s linear 

(ii) MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL s NO 
(iv) ANTI -SENSES NO 

(XX) SEQUENCE DESCRIPTIONS SEQ ID NO t 55s 
GACGACACGG TATCCGCTAC TCAGCTTGTT AAACAGCTAC AGCACACCCC CTC 53 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs both 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPES CDNA 
(iii) HYPOTHETICAL s NO 
(iv) ANTI -SENSE s NO 

(JCi) SEQUENCE DESCRIPTIONS SEQ ID NO: 56: 
GACGACGACC TGCAGACACC ACAGACACCG CCCAGCCCCT TACAAAGCTG TTCTGTGCAG 60 
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(2) INFORMATION FOR SEQ ID NO $57: 

(i) SEQUENCE CHARACTERISTICS t 
(A) LENGTH: 68 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESSt both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE* CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57; 
CATACCAAAG CCGTCGCCTT GGGCACCGAA GAAACACAAC CACTAAGTTG TTGCACAGAG 60 
ACTCAGTG 68 



(2) INFORMATION FOR SEQ ID NOx58: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TAATGTAATT GATTGTAATG ACTCTATGTG CACTACCAGT ACCGTATTCC AGCACCGTGT 60 
CCGTGGGCAC CGCAAAG 77 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
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ACAGACAACG ATAACCGACC ACCACAAGCA GCGGCCAAAC ACCCCGCCTT GGACAATAGA 60 
ACA&CACGTA CTGCAACTAA 80 

(2) INFORMATION FOR SEQ ID NO: 60 x 

<i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 266 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTI -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CATATGCAAT ACAATGCATT ATACAAACTG GACACATATA TATATTTGTG AAOAAGCATC 60 
AGTAACTGTG GTAGAGGGTC AAGTTGACTA TTATGGTTTA TATTATGTTC ATOAAGGAAT 120 
ACGAACATAT TTTGTGCAGT TTAAAGATGA TGCAGAAAAA TATAGTAAAA ATAAAGTATG 180 
GGAAGTTCAT GCGGGTGGTC AGGTAATATT ATGTCCTACA TCTGTGTTTA GCAGCAACGA 240 
AGTATCCTCT CCTGAAATTA TTAGGC 266 

(2) INFORMATION FOR SBQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(lv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
AGGATGTATA AAAAAACATG GATATACAGT GGAAGTGCAG TTTGATGGAG ACATATGCTA 60 
TTAGGCAGCA CTTGGCCAAC CACCCCGCCG CGACC 95 



(2) INFORMATION FOR SEQ ID NO: 62: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 81 base pairs 

(B) TYPBs nucleic acid 

<C) STRANDEDNESS: both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES CDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 62: 
CATGTTTTTT TATACATCCA TATCACCGCC AGTGGTATTT ATGTCAACAC CGCCAGAGAT 60 
AATTTATCAC CGCAGATGGT T 81 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iV) ANTI -SENSE S NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Ala Asp Asp Asp Pro Tyr Gly Thr Gly Gin Met Phe His Leu Asn 

1 5 10 15 

Thr Ala Leu Thr Bis ser lie Phe Asn Ala Glu Leu Tyr Ser Pro Glu 
20 25 30 

lie Pro Leu Ser Thr Asp Gly Pro Tyr Leu Gin lie Leu Glu Gin Pro 
35 40 45 

Lye Gin Arg Gly Phe Arg Phe Arg Tyr Val Cys Glu Gly Pro Ser His 
50 55 60 

Gly Gly Leu Pro Gly Ala Ser Ser Glu Lys Asn Lys Lys ser Tyr Pro 
65 70 75 80 

Gin Val Lys lie cys Asn Tyr Val Gly Pro Ala Lys Val II Val In 
85 90 95 



Leu Val Thr Asn Gly Lys Asn lie His Leu His Ala His Ser Leu Val 
100 105 110 
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Gly Lys Bis Cys Glu Asp Gly Val Cys Thr Val Thr Ala Gly Pro Lys 
115 120 125 

Asp Met val val Gly Phe Ala Asn Leu Gly He Leu Bis Val Thr Lys 

130 135 140 

Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu Ala Cys Zle 
145 150 155 160 

Arg Gly Tyr Asn Pro Gly Leu Leu Val Bis 6er Asp Leu Ala Tyr Leu 
165 170 175 

Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Thr Asp Arg Glu Lys Glu 
180 185 190 

lie He Arg Gin Ala Ala Val Gin Gin Thr Lys Glu Met Asp Leu Ser 
195 200 205 

Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro Asp Ser Thr Gly Ser 
210 215 220 

Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp Ala Zle Tyr Asp ser 
225 230 235 240 

Lys Ala Pro Asn Ala Ser Asn Leu Lys Zle val Arg Met Asp Arg Thr 
245 250 255 

Ala Gly cys val Thr Gly Gly Glu Glu Zle Tyr Leu Leu Cys Asp Lys 
260 265 270 

Val Gin Lys Asp Asp Zle Gin Zle Arg Phe Tyr Glu Glu Glu Glu Asn 
275 280 285 

Gly Gly val Trp Glu Gly Phe Gly Asp Phe ser Pro Thr Asp Val Bis 
290 295 300 

Arg Gin Phe Ala Zle Val Phe Lys Thr Pro Lys Tyr Lys ABp val Asn 
305 310 315 320 

Zle Thr 



(2) INFORMATION FOR SEQ ZD NOt64t 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH x 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: p ptide 

(iii) HYPOTHETICAL: NO 

<iV) ANT I -SENSE: NO 

(v) fragment TYPE: internal 
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(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met Ala Glu Aap Asp Pro Tyr Leu Gly Arg Pro Glu Gin Met Phe Bii 
15 10 15 

Leu Asp Pro ser Leu Thr Bis Thr lie Phe Asn Pro Glu Val Phe Gin 
20 25 30 



Pro Gin Net Ale Leu Pro Thr Ala Asp Gly Pro Tyr Leu Gin lie Leu 
35 40 45 

Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr val Cys Glu Gly 
50 55 60 

Pro Ser Bis Gly Gly Leu Pro Gly Ala Ser ser Glu Lys Asn Lys Lys 
65 70 75 80 

Ser Tyr Pro Gin Val Lys He Cys Asn Tyr val Gly Pro Ala Lys val 
85 90 95 

He Val Gin Leu val Thr Asn Gly Lys Asn He Bis Leu Bis Ala Bis 

100 105 110 

ser Leu Val Gly Lys Bis Cys Glu Asp Gly He Cys Thr Val Thr Ala 
115 120 125 

Gly Pro Glu Asp cys Val Bis Gly Phe Ala Asn Leu Gly He Leu Bis 
130 135 140 

Val Thr Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu 
145 150 155 160 

Ala Cys He Arg Gly Tyr Asn Pro Gly Leu Leu Val Bis Pro Asp Leu 

165 170 175 

Ala Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly Asp Arg 
180 185 190 

Glu Lys Glu Leu He Arg Gin Ala Ala Leu Gin Gin Thr Lys Glu Met 
195 200 205 

Asp Leu Ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro Asp ser 
210 215 220 

Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp Ala He 
225 230 235 240 

Tyr Asp Ser Lys Ala Pro Asn Ala Ser Asn Leu Lys He Val Arg Met 
245 250 255 

Asp Arg Thr Ala Gly Cys Val Thr Gly Gly Glu Glu He Tyr Leu Leu 
260 265 270 

cys Asp Lys Val In Lys Asp Asp He Gin He Arg Phe Tyr lu Glu 
275 280 285 
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Glu Glu Aon Gly Gly Val Trp Glu Gly Phe Gly Asp Phe 8er Pr Thr 
290 295 300 

Asp val His Arg Gin Phe Ala lie val Phe Lys Thr Pro Lys Tyr Lys 
305 310 315 320 

Asp He Asn lie Thr 

325 



(2) INFORMATION FOR SEQ ZD NO :65s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<iii) HYPOTHETICAL t NO 

<iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(aci) SEQUENCE DESCRIPTION: SSQ ID NO: 65: 

Met Glu Pro Ala Asp Leu Leu Pro Leu Tyr Leu Gin Pro Glu Trp Gly 
15 10 15 

Glu Gin Glu Pro Gly Gly Ala Thr Pro Phe Val Glu lie Leu Glu Gin 
20 25 30 

Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys cya Glu Gly Arg ser 
35 40 45 

Ala Gly ser He Pro Gly Glu His Ser Thr Asp ser Ala Arg Thr His 
50 55 60 

Pro Thr He Arg Val Asn His Tyr Arg Gly Pro Gly Arg Val Arg Val 
65 70 75 60 

ser Leu Val Thr Lys Asp Pro Pro His Gly Pro His Pro Bis Glu Leu 
85 90 95 

Val Gly Arg His CyB Gin His Gly Tyr Tyr Glu Ala Glu Leu ser Pro 
100 105 110 

Asp Arg Ser He Bis ser Phe Gin Asn Leu Gly He Gin cys Val Lys 
115 120 125 

Lys Arg Glu Leu Glu Ala Ala val Ala Glu Arg He Arg Thr Asn Asn 

130 135 140 

Asn Pro Phe Asn Val Pr Met Glu Glu Arg Gly Ala Glu Tyr Asp Leu 
145 150 155 160 
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Ser Ala Val Arg Leu cys Phe Gin val Trp val Asn Gly Pro Gly Gly 
165 170 175 

Leu eye Pro Leu Pro Pro Val Leu Ser Gin Pro lie Tyr Asp Aan Arg 
180 185 190 

Ala Pro Ser Thr Ala Glu Leu Arg lie Leu Pro Gly Aap Arg Asn Ser 
195 200 205 

Gly Ser cya Gin Gly Gly Aep Glu lie Phe Leu Leu eye Aap Lya val 
210 215 220 

Gin Lye Glu Asp lie Glu Val Arg Phe Trp Ala Glu Gly Trp Glu Ala 
225 230 235 240 

Lys Gly ser Phe Ala Ala Ala Asp Val Bis Arg Gin val Ala lie Val 
245 250 255 



Phe Arg Thr Pro Pro Phe Arg Glu Arg ser Leu Arg 
260 265 



(2) INFORMATION FOR SEQ ZD NOt66t 

(1) SEQUENCE CHARACTERISTICS I 

(A) LENGTH t 263 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSES NO 

(v) FRAGMENT TYPE t internal 

(Xi> SEQUENCE DESCRIPTIONS SEQ ID NOs66s 

Met Asp Asp Leu Phe Pro Leu lie Phe Pro Ser Glu Pro Ala Gin Ala 
15 10 15 

Ser Gly Pro Tyr Val Glu lie lie Glu Gin Pro Lys Gin Arg Gly Met 
20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser lie Pro Gly 
35 40 45 

Glu Arg Ser Thr Asp Thr Thr Lys Thr Bis Pro Thr lie Lys lie Asn 
50 55 60 

Gly Tyr Thr Gly Pro Cly Thr Val Arg lie Ser Leu Val Thr Lys Asp 
65 70 75 80 
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Pro Pro Bio Arg Pro Bis Pro Bis Glu Leu Val Gly Lys Asp cys Arg 
85 90 95 

Asp Gly Tyr Tyr Glu Ala Asp Leu Cys Pro Asp Arg Ser lie Bis Ser 
100 105 110 

Phe Gin Asn Leu Gly He Gin cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

Ala lie ser Gin Arg lie Gin Thr Asn Asn Asn Pro Phe Bis val Pro 
130 135 140 

lie Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 
145 150 155 160 

Phe Gin Val Thr Val Arg Asp Pro Ala Gly Arg Pro Leu Leu Leu Thr 
165 170 175 

Pro val Leu ser Bis Pro lie Phe Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Lys lie cys Arg val Asn Arg Asn ser Gly ser cys Leu Gly 
195 200 205 

Gly Asp Glu He Phe Leu Leu cys Asp LyB Val Gin Lys Glu Asp He 
210 215 220 

Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe ser 
225 230 235 240 

Gin Ala Asp Val Bis Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 
245 250 255 

Tyr Ala Asp pro ser Leu Gin 
260 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) topology: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE* NO 

<v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DE SCRIPT I N: SEQ ID NO:67: 

M t Asp Glu Leu Phe Pro Leu lie Phe Pro Ala Glu Pro Ala Gin Ala 
15 10 15 
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8 r Gly Pro Tyr Val Clu lie lie Glu Gin Pro Lys Gin Arg Gly Met 
20 25 30 

Arg Phe Arg Tyr Lys eye Glu Gly Arg Ser Ala Gly Ser lie Pro Gly 
35 40 45 

Glu Arg Ser Thr Asp Thr Thr Lys Thr Bie Pro Thr lie Lye tie Aen 
50 55 60 

Gly Tyr Thr Gly Pro Gly Thr val Arg lie ser Leu val Thr Lye Asp 
65 70 75 60 

Pro Pro Bie Arg Pro Bis Pro Bis Glu Leu Val Gly Lys Asp Cys Arg 
85 90 95 

Asp Gly Phe Tyr Glu Ala Glu Leu cys Pro Asp Arg cys He Bis ser 
100 105 110 

Phe Gin Asn Leu Gly lie Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

Ala He Ser Gin Arg lie Gin Thr Asn Asn Asn Pro phe Gin val pro 
130 135 140 

He Glu Glu Gin Arg cly Asp Tyr Asp Leu Asn Ala Val Arg Leu cys 
145 150 155 160 



Phe Gin val Thr val Arg Asp Pro ser Gly Arg Pro Leu Arg Leu Pro 
165 170 175 



Pro Val Leu Pro Bis Pro lie Phe Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Lye Zle Cys Arg Val Asn Arg Asn Ser Gly Ser Cys Leu Gly 

195 200 205 

Gly Asp Glu lie Phe Leu Leu Cys Asp Lys Val Gin Lys Glu Asp He 

210 215 220 

Glu val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe ser 
225 230 235 240 

Gin Ala Asp Val Bis Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 
245 250 255 

Tyr Ala Asp Pro Ser Leu Gin 
260 



(2) INFORMATION FOR SEQ ZD NO 1 68 J 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH t 299 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lin ar 
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(ii) MOLECULE TYPES peptide 
(iii) HYPOTHETICAL I NO 
<iv) ANTI-SENSE x NO 

(v) FRAGMENT TYPEs internal 
(xi) SEQUENCE DESCRIPTIONS SEQ ZD NO: 68: 

Met Phe Pro Asn Gin Asn Asn Gly Ala Ala Pro Gly Gin Gly pro Ala 
15 10 15 

val Asp Gly Gin Gin Sar Leu Asn Tyr Asn Gly Leu Pro Ala Gin Gin 
20 25 30 

Gin Gin Gin Leu Ala Gin Ser Thr Lya Asn Val Arg Lya Lye Pro Tyr 
35 40 45 

Val Lye lie Thr Glu Gin Pro Ala Gly Lye Ala Leu Arg Phe Arg Tyr 
50 55 60 

Glu eye Glu Gly Arg ser Ala Gly ser He Pro Gly val Asn ser Thr 
65 70 75 80 

Pro Glu Asn Lye Thr Tyr Pro Thr He Glu lie Val Gly Tyr Lye Gly 
85 90 95 

Arg Ala Val Val Val Val Ser eye Val Thr Lya Asp Thr Pro Tyr Arg 
100 105 110 

Pro Bis Pro Hie Asn Leu Val Gly Lya Glu Gly cya Lye Lys Gly val 
115 120 125 

cys Thr Leu Glu He Asn ser Glu Thr Met Arg Ala Val Phe ser Asn 
130 135 140 

Leu Gly He Gin Cys Val Lya Lys Lys Asp lie Glu Ala Ala Leu Lye 
145 150 155 160 

Ala Arg Glu Glu He Arg Val Asp Pro Phe Lys Thr Gly Phe Ser His 
165 170 175 

Arg Phe Gin Pro ser ser He Asp Leu Asn Ser Val Arg Leu Cya Phe 
180 185 190 

Gin Val Phe Met Glu Ser Glu Gin Lys Gly Arg Phe Thr Ser Pro Leu 
195 200 205 

Pro Pr val Val Ser Glu Pr H Ph Asp Lys Lys Ala Met ser Asp 
210 215 220 

Leu Val He Cys Arg L u cys s r cys ser Ala Thr val Phe Gly Asn 
225 230 235 240 

Thr Gin He He Leu Leu Cys Glu Lys Val Ala Lys Glu Asp He Ser 
245 250 255 
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val Arg Ph pbe Glu Glu Lys Asn Gly Gin ser Val Trp Glu Ala Phe 
260 265 270 

Gly Asp Phe Gin His Thr Asp val Bis Lys Gin Thr Ala zle Thr Phe 
275 280 285 

Lys Thr Pro Arg Tyr Bis Thr Leu Asp Us Thr 
290 295 



(2) INPORMAT ION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTH i 261 amino acids 

(B) TYPE: amino acid 
(D) topology: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Asp Phe Leu Thr Asn Leu Arg Phe Thr Glu Gly lie Ser Glu Pro 
15 10 15 

Tyr He Glu He Phe Glu Gin Pro Arg Gin Arg Gly Thr Arg Phe Arg 
20 25 30 

Tyr Lys Cys Glu Gly Arg ser Ala Gly Ser lie Pro Gly Glu His Ser 
35 40 45 

Thr Asp Asn Asn Lys Thr Phe Pro Ser He Gin He Leu Asn Tyr Phe 

50 55 60 



Gly Lys val Lys He Arg Thr Thr Leu val Thr Lys Asn Glu Pro Tyr 
65 70 75 80 

Lys Pro His Pro His Aop Leu Val Gly Lys Gly Cys Arg Asp Gly Tyr 
85 90 95 

Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gin Val Leu Ser Phe Gin Asn 
100 105 110 

Leu Gly lie Gin cys val Lys Lys Lys Asp Leu Lys Glu ser lie ser 
115 120 125 



Leu Arg H Ser Lys Lys Asn Pro phe Asn val Pro Glu Glu Gin Leu 
130 135 140 
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His Asn II Asp Glu Tyr Asp Lou Asn Val Val Arg Leu Cys Pho Gin 
145 ISO 155 160 

Ala Phe Leu Pro Asp Glu His Gly Asn Tyr Thr Leu Ala Leu Pro Pro 
165 170 175 

Leu lie ser Asn Pro He Tyr Asp Asn Arg Ala Pro Asn Thr Ala Glu 
180 185 190 

Leu Arg He cys Arg val Asn Lys Asn cys Gly ser Val Lys Gly Gly 
195 200 205 

Asp Glu He Pho Leu Leu Cys Asp Lys Val Gin Lys Asp Asp He Glu 
210 215 220 

Val Arg Pho Val Leu Gly Asn Trp Glu Ala Lys Gly ser Phe Ser Gin 
225 230 235 240 

Ala Asp Val Bis Arg Gin val Ala He Val Phe Arg Thr Pro Pro Phe 
245 250 255 

Leu Gly Asp He Thr 

260 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 262 amino acids 

(B) TYPE i amino acid 
(D) TOPOLOGY t linear 

(ii> MOLECULE TYPE t peptide 
(iii) HYPOTHETICAL s NO 
(iv) ANTI-SENSE t NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Asp Phe Leu Thr Asn Lou Arg Phe Thr Glu Gly He Ser Glu Pro 
15 10 15 

Tyr He Glu He Phe Glu Gin Pro Arg Gin Arg Gly Met Arg Phe Arg 
20 25 30 

Tyr Lys eye Glu Gly Arg ser Ala Gly ser He Pro Gly Glu His ser 
35 40 45 

Thr Asp Asn Asn Lys Thr Phe Pro ser He Gin He Leu Asn Tyr Phe 
50 55 60 

Gly LyB Val Lys II Arg Thr Thr Leu Val Thr Lys Asn Glu Pro Tyr 
65 70 75 80 
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Lys pro His Pro His Asp Leu val Gly Lys Gly eye Arg Asp Gly Tyr 
85 90 95 

Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gin Val Leu Ser Phe Gin Asn 
100 105 110 

Leu Gly Zle Gin Cys Val Lys Lys Lys Asp Leu Lys Glu Ser lie Ser 

115 120 125 

Leu Arg Zle Ser Lys Lys Zle Asn Pro Phe Asn Val Pro Glu Glu Gin 

130 135 140 

Leu His Asn Zle Asp Glu Tyr Asp Leu Asn Val Val Arg Leu Cys Phe 
145 150 155 160 

Gin Ala Phe Leu Pro Asp Glu His Gly Asn Tyr Thr Leu Ala Leu Pro 
165 170 175 

Pro Leu Zle ser Asn Pro Zle Tyr Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

Glu Leu Arg Zle Cys Arg Val Asn Lys Asn Cys Gly Ser Val Lys Gly 
195 200 205 

Gly Asp Glu Zle Phe Leu Leu Cys Asp Lys val Gin Lys Asp Asp Zle 

210 215 220 

Glu Val Arg Phe val Leu Gly Asn Trp Glu Ala Lys Gly ser Phe ser 
225 230 235 240 

Gin Ala Asp Val Bis Arg Gin Val Ala Zle Val Phe Arg Thr Pro Pro 
245 250 255 

Phe Leu Gly Asp Zle Thr 
260 



(2) INFORMATION FOR SEQ ID NOi71: 

(i) SEQUENCE CHARACTERISTICS! 

<A) LENGTH: 314 amino acids 
(8) TYPE: amino acid 
<D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL I NO 

(iv) ANTI-SENSE t NO 

(v) FRAGMENT TYPE: int rnal 

<xi) SEQUENCE DESCRIPTION t SEQ ID NO: 71: 

Met Ser Asn Lys Lys Gin Ser Asn Arg Leu Thr Glu Gin His Lys Leu 
15 10 15 
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Ser Gin Gly Val II Gly He Phe Gly Asp Tyr Ala Lys Ala Hie Asp 
20 25 30 

Leu Ala val Gly Glu Val Ser Lye Leu Val Lys Lya Ala lieu ser Aan 
35 40 45 . 

Glu Tyr Pro Gin Leu Ser Phe Arg Tyr Arg Asp ser lie Lys Lys Thr 
50 55 60 

Glu He Asn Glu Ala Leu Lys Lys He Asp Pro Asp Leu Gly Gly Thr 

65 70 75 80 

Leu Phe Val Ser Asn Ser Ser He Lys Pro Asp Gly Gly He Val Glu 
85 90 95 

Val Lys Asp Asp Tyr Gly Glu Trp Arg Val Val Leu Val Ala Glu Ala 
100 105 110 

Lys Bis Gin Gly Lys Asp He He Asn He Arg Asn Gly Leu Leu Val 

115 120 125 

Gly Lys Arg Gly Asp Gin Asp Leu Met Ala Ala Gly Asn Ala He Glu 
130 135 140 

Arg ser Bis Asn He ser Glu He Ala Asn Phe Net Leu ser Glu ser 

145 150 155 160 

Bis Phe Pro Tyr Val Leu Phe Leu Glu Gly ser Asn Phe Leu Thr Glu 
165 170 175 

Asn He ser He Thr Arg Pro Asp Gly Arg val val Asn Leu Glu Tyr 
180 185 190 

Asn ser Gly ser Glu ser Bis Phe Pro Tyr Val Leu Phe Leu Glu Gly 
195 200 205 

ser Asn Phe Leu Thr Glu Asn He Ser He Thr Arg Pro Asp Gly Arg 
210 215 220 

Val Val Asn Leu Glu Tyr Asn Ser Gly He Leu Asn Arg Leu Asp Arg 
225 230 235 240 

Leu Thr Ala Ala Asn Tyr Gly Met Pro He Asn Ser Asn Leu Cys He 
245 250 255 

Asn Lys Phe Val Asn Bis Lye Asp Lys Ser lie Met Leu Gin Ala Ala 
260 265 270 

Ser He Tyr Thr Gin Gly Asp Gly Arg Glu Trp Asp Ser Lys He Met 
275 280 285 

Ph Glu II Met Ph Asp II Ser Thr Thr ser Leu Arg val Leu Gly 
290 295 300 



Arg Asp Leu Phe Glu Gin Leu Thr Ser Lys 
305 310 
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(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTHS 17 amino acids 

(B) TYFEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES peptide 

(iii) HYPOTHETICAL S NO 

(iv) ANTI -SENSES NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Cye Asp Thr Asp Aap Arg His Arg He Glu Glu Lye Arg Lye Arg Lye 
15 10 15 

Thr 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPEs internal 
(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 73: 

Gly Asp Pro Gly Lys Lys Lys Gin His He Cys His He Gin Gly cys 

15 10 15 

Gly Lys val Tyr Gly Lys Thr Ser His Leu Arg Ala Bis Leu Arg Trp 
20 25 30 

Bis Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly Lys 
35 40 45 

Arg Phe Thr Arg 6 r Asp Glu Leu Gin Arg Bis Lys Arg Thr Bis Thr 
50 55 60 

Gly Glu Lys Lys Phe Ala cys Pr Glu cys Pro Lys Arg Phe Met Arg 
65 70 75 80 
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S r Asp His Leu Ser Lys Bis He Lys Thr Bis Gin Asn Lys Lys Gly 
65 90 95 

Gly Pro Gly Val Ala Leu Ser Val Gly Thr Leu Pro Leu Asp Ser Gly 
100 105 no 

Ala Gly ser Glu Gly Ser Gly Thr Ala Thr Pro ser Ala Leu He Thr 
115 120 125 

Thr Asn Met Val Ala Met Glu Ala He cys Pro Glu Gly He Ala Arg 
130 135 140 

Leu Ala Asn Ser Gly He Asn Val Met Gin Val Ala Asp Leu Gin Ser 
145 150 155 160 

He Asn He ser Gly Asn Gly Phe 
165 



(2) INFORMATION FOR SEQ ID NOt74t 

(1) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 181 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION t SEQ ID NO: 74: 

ser Gly He val Pro Gin Leu Gin Asn He val ser Thr val Asn Leu 
15 10 i 5 

Gly cys Lys Leu Asp Leu Lys Thr He Ala Leu Arg Ala Arg Asn Ala 
20 25 30 

Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val He Met Arg He Arg clu 
35 40 45 

Pro Arg Thr Thr Ala Leu He Phe 6er Ser Gly Lys Met Val Cys Thr 
50 55 60 

Gly Ala Lys ser Glu Glu Gin Ser Arg Leu Ala Ala Arg Lys Tyr Ala 
*5 70 75 80 

Arg Val Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Leu Asp Phe Lys 
85 90 95 

He Gin Asn Met Val Gly S r Cys Asp Val Lys Phe Pr He Arg Leu 

100 105 no 
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Glu Gly L u Val Leu Thr Bis Gin Gin Ph Ser Ser Tyr Glu Pro Glu 
115 120 125 

Leu Phe Pro Gly Leu lie Tyr Arg Met He Lys Pro Arg He val Leu 
130 135 140 



Leu He Phe Val ser Gly Lys Val Val Leu Thr Gly Ala Lye Val Arg 
145 150 155 160 

Ala Glu He Tyr Glu Ala Phe Glu Asn He Tyr Pro He Leu Lye Gly 
165 170 175 

Phe Arg Lye Thr Thr 
180 



(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH i 85 amino acids 

(B) types amino acid 
(D) TOPOLOGY j linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL t NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION x 8EQ ID NO 1 75 1 

Ser eye Phe Ala Leu He Ser Gly Thr Ala Asn Gin val Lys cys Tyr 
15 10 15 

Arg Phe Arg Val Lys Lys Asn His Arg His Arg Tyr Glu Asn Cys Thr 
20 25 30 

Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Ala Glu Arg Gin Gly Gin 
35 40 45 

Ala Gin He Leu He Thr Phe Gly Ser Pro ser Gin Arg Gin Asp Phe 
50 55 60 

Leu Lys Bis Val Pro Leu Pro Pro Gly Net Aen He Ser Gly Phe Thr 
65 70 75 80 

Ala S r L u Asp Phe 
85 



(2) INFORMATION FOR SEQ ID NO t 76s 
(i) SEQUENCE CHARACTERISTICS i 
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(A) LENGTH t 87 amino acids 

(B) TYPE t amino acid 
(D) TOPOLOGY; linear 

(ii) KOLECULE TYPE: peptide 

(iii) HYPOTHETICAL i NO 

(iv) ANN -SENSES NO 

(v) fragment TYPE: internal 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO:76i 

cys Pro CyB Leu Leu lie Gly Thr Ser Gly Asn Gly Asn Gin Val Lye 
15 10 15 



CyB Tyr Ser Phe Arg Val Lya Arg Trp Bis Asp Arg Asp Lys Tyr Hie 
20 25 30 

Hia Thr Thr Thr Trp Trp Ala val Gly Gly Gin Gly ser Glu Arg Pro 
35 40 45 

Gly Asp Ala Thr Val lie Val Thr Phe Lys Aap Gin Ser Gin Arg Ser 
50 55 60 

Hi b Phe Leu Gin Gin Val Pro Leu Pro Pro Gly Net Ser Ala His Gly 
65 70 75 80 

Val Thr Met Thr Val Asp Phe 
85 



(2) INFORMATION FOR SEQ ID NO 1 77s 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Pro Pro Val lie Cys Leu Lye Gly Gly His Aan Gin Leu Lys Cya Leu 
15 10 15 

Arg Tyr Arg Leu Lye Ser Lye His Ser ser Leu Ph Asp Cys II Br 
20 25 30 
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Thr Thr Trp ser Trp Val Asp Thr Thr ser Thr cys Arg Leu cly ser 
35 40 45 

Gly Arg Met Leu lie Lye Phe Ala Asp Ser Glu Gin Arg Aep Lys Phe 
50 55 60 

Leu ser Arg val Pro Leu Pro Ser Thr Thr Gin Val Phe Leu Gly Asn 
65 70 75 80 

Phe Tyr Gly Leu 



(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 84 amino acid* 

(B) TYPEt amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE i peptide 

(iii) HYPOTHETICAL t NO 
<iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Pro Pro val lie Leu Val Arg Gly Gly Ala Asn Thr Leu Lys Cys Phe 
15 10 15 

Arg Asn Arg Ala Arg Val Arg Tyr Arg Gly Leu Phe Lys Tyr Phe Ser 

20 25 30 

Thr Thr Trp Ser Trp val Ala Gly Asp ser Thr Glu Arg Leu Gly Arg 
35 40 45 

Ser Arg Met Leu lie Leu Phe Thr ser Ala Cys Gin Arg Glu Lys Pro 
50 55 60 

Asp Glu Thr Val Lys Tyr Pro Lys Gly val Asp Thr ser Tyr Gly Asn 
65 70 75 80 

Leu Asp ser Leu 



(2) INFORMATION FOR SEQ ID NO:79: 
(i) SEQUENCE CHARACTERISTICS : 

(A) length: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL l NO 
(IV) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 J 

Pro Pro Val Val eye Val Lys Gly Cly Ala Asn Gin Leu Lye eye Leu 

15 10 15 

Arg Tyr Arg Leu Lye Ala ser Thr Gin val Asp Phe Asp Ser He Ser 
20 25 30 

Thr Thr Trp Bis Trp Thr Asp Arg Lys Aen Thr Glu Arg He Gly Ser 
35 40 45 

Ala Arg Met Leu Val Lys Phe He Asp Glu Ala Gin Arg Glu Lya Phe 

50 55 60 

Leu Glu Arg Val Ala Leu Pro Arg Ser Val Ser Val Phe Leu Gly Gin 
65 70 75 80 

Phe Asn Gly ser 



(2) INFORMATION FOR SEQ ID NO 1 80 1 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPES peptide 

(iii) HYPOTHETICAL : NO 

(iv> ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Thr Pro He val Gin Leu Gin Gly Asp ser Asn cyB Leu Lys cys Phe 
15 10 15 

Arg Tyr Arg Leu Asn Asp Lys Tyr Lys His Leu Phe Glu Leu Ala ser 
20 25 30 

Ser Thr Trp Bis Trp Ala S r Pro Glu Ala Pro Bis Lys Asn Ala He 
35 40 45 

val Thr Leu Thr Tyr ser ser Glu Glu Gin Arg Gin Gin Phe Leu Asn 
50 55 60 
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Ser val Lya He Pro Pr Thr He Arg His Lys Val Gly phe Met Ser 
65 70 75 80 

Leu Bis Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Thr Pro He Val Gin Phe Gin Gly Glu Ser Asn Cys Leu Lys Cys Phe 
15 10 15 



Arg Tyr Arg Leu Ann Arg Asp His Arg Bis Leu Phe Asp Leu He Ser 
20 25 30 

Ser Thr Trp Hie Trp Ala Ser Ser Lya Ala Pro His Lye His Ala He 

35 40 45 

Val Thr val Thr Tyr Asp ser Glu Glu Gin Arg Gin Gin Phe Leu Asp 
50 55 60 

Val Val Lya He Pro Pro Thr He Ser His Lys Leu Gly Phe Met Ser 
65 70 75 80 

Leu His Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(V) FRAGMENT TYPE: int rnal 
(Xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 82: 

Thr Pro lie lie His Leu Lys Gly Asp Arg Asn ser Leu Lys Cys Leu 
15 10 15 

Arg Tyr Arg Leu Arg Lys Bis Ser Asp Bis Tyr Arg Asp He Ser ser 
20 25 30 

Thr Trp Bis Trp Thr Gly Ala Gly Asn Glu Lys Thr Gly He Leu Thr 
35 40 45 

Val Thr Tyr Bis Ser Glu Thr Gin Arg Thr Lys Phe Leu Asn Thr Val 
50 55 60 

Ala He Pro Asp Ser Val Gin He Leu Val Gly Tyr Asn Thr Met Tyr 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 83$ 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Thr Pro He Val His Leu Lys Gly Asp Ala Asn Thr Leu Lys cys Leu 
15 10 15 

Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr Thr Ala Val Ser Ser 
20 25 30 

Thr Trp Bis Trp Thr Gly Bis Aen Tyr Lys Bis Lys Ser Ala He Val 
35 40 45 

Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin Phe Leu ser Gin 
50 55 60 

Val Lye He Pro Lys Thr He Thr Val ser Thr Gly Phe Met Ser He 
65 70 75 SO 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 81 amino acids 
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(B) type: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : HO 

(iv) ANTI -SENSE: MO 

(v) FRAGMENT TYPEs internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO 1 84: 

Ala Pro lie val Bis Leu Lys Gly Glu ser Aan Ser Leu Lys Cye Leu 
15 10 15 

Arg Tyr Arg Leu Lys Pro Tyr Asn Glu Leu Tyr Ser ser Met ser ser 
20 25 30 

Thr Trp Hie Trp Thr Ser Aap Aan Lys Aan Ser Lye Aan Gly lie val 
35 40 45 

Thr Val Thr Phe val Thr Gly Gin Gin Gin Gin Met Pbe Leu Gly Thr 
50 55 60 

val Lye He pro pro Thr val Gin lie ser Thr Gly phe Met Thr Leu 
65 70 75 80 

Val 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 21 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL I NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Gly lie Val Glu Gin Cye eye Thr Ser lie Cye Ser Leu Tyr Gin Leu 
15 10 15 

Glu Aen Tyr Cye Aan 
20 



(2) INFORMATION FOR SEQ ID NO: 86: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE x peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NOt86t 

Phe Val Aen Gin Hi» Leu cya Gly ser Hie Leu val Glu Ala Leu Tyr 

1 5 10 15 

Leu val cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lye Thr 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Gly He Val Glu Gin Cya Cys Ala Ser Val Cys ser Leu Tyr Gin Leu 
15 10 15 

Glu Aen Tyr Cys Aen 
20 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptid 



(iii) HYPOTHETICAL: NO 
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<iv> ANT I -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Phe Val Aan Gin His Leu Cys Gly Ser Bis Leu Val Glu Ala Leu Tyr 
15 10 15 

Leu val eye Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 

20 25 30 

(2) INFORMATION FOR SEQ ID NO 1 89: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 : 

Gin Leu Tyr Ser Ala Leu Ala Aen Lys Cys cys His val Gly Cys lie 
15 10 15 

Lys Arg Ser Leu Ala Arg Phe cys 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 90: 
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Asp ser Trp Met Glu Glu Val lie Lys lie cys Gly Arg Glu Leu Val 
15 10 15 

Arg Ala Gin He Ala He Cys Gly Met Ser Thr Trp Ser Lys Arg Ser 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO:91: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

Glu Glu Lys Met Gly Thr Ala Lys Lys Cys Cys Ala He Gly Cys Ser 
15 10 15 

Thr Glu Asp Phe Arg Met Val Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

<i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Arg Pro Asn Trp Glu Glu Arg Ser Arg Leu Cys Gly Arg Asp Leu He 
15 10 15 

Arg Ala Phe He Tyr Leu Cys Gly Gly Thr Arg Trp Thr Arg Leu Pro 
20 25 30 
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Asn Phe Gly Asn Tyr Pro He Met 
35 40 



(2) INFORMATION FOR SEQ ID NO: 93; 

(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTH: 182 amino acids 

(B) TYPE i amino acid 
(D) TOPOLOGY* linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL t NO 

(iv) ANTI-SENSE I NO 

(v) FRAGMENT TYPEs internal 
(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOt93t 

Ser Gly He Val Pro Thr Leu Gin Asn He Val Ser Thr Val Asn Leu 

15 10 15 

Asp Cys Lya Leu Asp Leu Lys Ala He Ala Leu Gin Ala Arg Asn Ala 

20 25 30 

Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val He Met Arg He Arg Glu 
35 40 45 

Pro Lye Thr Thr Ala Leu He Phe Ala ser Gly Lys Met Val Cys Thr 
50 55 60 

Gly Ala Lys ser Glu Asp Phe ser Lys Met Ala Ala Arg Lys Tyr Ala 
65 70 75 80 

Arg He Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Lys Asp Phe Lys 
65 90 95 

He Gin Asn He Val Gly ser Cys Asp Val Lys Phe Pro He Arg Leu 
100 105 110 

Glu Gly Leu Ala Tyr ser Bis Ala Ala Phe ser ser Tyr Glu Pro Glu 
115 120 125 

Leu Phe Pro Gly Leu He Tyr Arg Met Lys val Pro Lys He Val Leu 
130 135 140 

Leu He Phe Val Ser Gly Lys He Val He Thr Gly Ala Lys Met Arg 
145 150 155 160 

Asp Glu Thr Tyr Lys Ala Phe Glu Asn Ho Tyr Pr Val Leu S r Glu 
165 170 175 

Phe Arg Lys He Gin Gin 
180 
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(2) INFORMATION FOR SEQ ZD HO s 94 s 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH t 84 amino acids 

(B) TTPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES peptide 

(iii) HYPOTHETICAL S NO 

(iv) ANTI-SENSE t NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION z SEQ ID NO:94: 

Ann ser Asn Ser Thr Pro lie Val His Lou Lya Gly Asp Ala Asn Thr 

15 10 15 

Leu Lye cys Leu Arg Tyr Arg Phe Lya Lya His Cys Thr Leu Tyr Thr 
20 25 30 

Ala val ser ser Thr Trp His Trp Thr Gly His Asn Val Lya His Lya 
35 40 45 

Ser Ala He Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Aap Gin 
50 55 60 

Phe Leu ser Gin Val Lys He Pro Lye Thr He Thr val Ser Thr Gly 
65 70 75 80 

Phe Met ser He 



(2) INFORMATION FOR SEQ ID NO*95: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 84 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

<v) FRAGMENT TYPE: int rnal 
(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO:95: 

Asn Ser Asn Thr Thr Pro He val Bis Leu Lys Gly Asp Ala Asn Thr 
15 10 15 
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Leu Lys cys Leu Arg Tyr Arg phe Lys Lys Bis eye Thr Leu Tyr Thr 

20 25 30 

Ala Val Ser Ser Thr Trp Bis Trp Thr Gly Bis Asn Val Lys Bis Lys 
35 40 45 

Ser Ala lie Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin 
50 55 60 

Phe Leu Ser Gin Val Lys He Pro Lys Thr He Thr Val ser Thr Gly 
65 70 75 60 

Phe Met ser He 



(2) INFORMATION FOR SEQ ID NO:96: 

<1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH t 83 amino acids 

(B) TTPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TTPE J peptide 
(ill) HYPOTHETICAL : HO 
(iv) ANTI -SENSE s NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION t SEQ ID NOs96x 

ser Gly Asn Thr Thr Pro He He His Leu Lys Gly Asp Arg Asn ser 
15 10 15 

Leu Lys Cys Leu Arg Tyr Arg Leu Arg Lys His Ser Asp His Tyr Arg 
20 25 30 

Asp He Ser Ser Thr Trp His Trp Thr Gly Ala Gly Asn Glu Lys Thr 
35 40 45 

Gly He Leu Thr Val Thr Tyr His ser Glu Thr Gin Arg Thr Lys Phe 
50 55 60 

Leu Asn Thr val Ala He Pro Asp ser val Gin He Leu val Gly Tyr 
65 70 75 80 

Met Thr Net 



(2) INFORMATION FOR SEQ ID NOt97t 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY s linear 
(ii) MOLECULE T?FEi peptide 
(iii) HYPOTHETICAL: NO 
(iV) ANTI-SENSES NO 

(v) FRAGMENT TYPEs internal 
(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 97: 

ser Gly Atn Thr Ala Pro lie val Bis Leu Lys Gly Glu ser Asn ser 

15 10 15 

Leu Lye cys Leu Arg Tyr Arg Leu Lys Pro Tyr Lye Glu Leu Tyr Ser 
20 25 30 



Ser Met ser ser Thr Trp Hie Trp Thr ser Asp Asn Lys Asn Ser Lys 
35 40 45 

Asn Gly He Val Thr Val Thr Phe Val Thr Glu Gin Gin Gin Gin Met 
50 55 60 

Phe Leu Gly Thr Val Lye He Pro Pro Thr Val Gin lie Ser Thr Gly 
65 70 75 SO 

Phe Met Thr Leu 



(2) INFORMATION FOR SEQ ID NO 1 98 1 
(i) SEQUENCE CHARACTERISTICS s 

(A) lengths 89 amino acids 
<B) TYPEs amino acid 
(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSES NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 98; 

Ser Gly Asn Thr Ser Cys Phe Ala Leu He Ser Gly Thr Ala Aon Gin 
15 10 15 

Val Lys Cys Tyr Arg Ph Arg Val Lye Lys Asn Bis Arg Bis Arg Tyr 
20 25 30 

Glu Asn cys Thr Thr Thr Trp Phe Thr val Ala Asp Asn Gly Ala lu 
35 40 45 
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Arg Cln Cly Gin Ala Gin lie Leu lie Thr Phe Gly ser Pro S r Gin 
50 55 60 

Arg Gin Asp phe Leu Lys His val Pro Leu Pro Pro Gly Met Aen He 

65 70 75 80 

ser Gly Pbe Thr Ala Ser Leu Asp Phe 
85 



(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iV) ANTI-SENSE t NO 

(v) fragment TYPE: c-terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Ser Ann Lys Lys Thr Thr Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 100 t 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Asn Ser Asn Thr 
1 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
(A) length: 4 amino acids 
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(B) TYPE: amin acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Ser Gly Asn Thr 
1 

(2) INFORMATION FOR SEQ ID NO: 102 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) type: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANN -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) 6EQUENCE DESCRIPTION: SEQ ID NO: 102: 

ser ser Gly ser ser Gly 

1 5 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N :103: 



Cys Tyr Pro Glu He Lys Asp Lys Glu Glu Val Gin Arg Lye Arg 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO 1 104: 

(1) SEQUENCE CHARACTERISTICS I 

(A) LENGTH t 66 amino acide 

(B) TYPE i amino acid 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPEs protein 

(iii) HYPOTHETICAL I NO 
<iv) ANTI-SENSES NO 

(v) FRAGMENT TYPEt N -terminal 
(Xi) SEQUENCE DESCRIPTION s SEQ ID NOsl04s 

Met clu Gin Arg lie Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 

Thr Lys Thr Ala Lys Asp Leu Gly val Tyr Gin ser Ala lie Asn Lys 

20 25 30 



Ala lie Bis Ala Gly Arg Lys lie Phe Leu Thr lie Asn Ala Asp Gly 
35 40 45 

Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 105 S 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N -terminal 

(Xi) SEQUENCE DESCRIPTI N: SEQ ID NO: 105: 



Met Glu Gin Glu II Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 
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Thr Lya Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 

20 25 30 

Ala He Hie Ala Gly Arg Lye He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

ser val Tyr Ala Glu Glu val Lys Pro Phe Pro ser Asn Lye Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SBQ ID NO: 106 J 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt protein 

(iii) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE t N-terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Met Arg Gin Arg He Thr Leu LyB Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 

Thr Lys Thr Ala Lye Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
20 25 30 

Ala He His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 107 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptid 



(iii) HYPOTHETICAL : NO 
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(iv) ANTI-SENSE t NO 

(V) FRAGMENT TYPE* N -terminal 
(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 107 l 

ser Thr Lys Lys Lye Pro Leu Thr oln olu Gin Leu Glu Asp Ala Arg 
15 10 15 

Arg Leu Lye Ala He Tyr Glu Lys Lys Lys Asn Glu Leu Gly Leu Ser 

20 25 30 

Gin Glu ser val Ala Asp Lys Met Gly Met Gly Gin ser Gly Val Gly 
35 40 45 

Ala Leu Phe Asn Gly He Asn Ala Leu Asn Ala Tyr Asn Ala Ala Leu 
50 55 60 

Leu Ala Lys He Leu Lys Val Ser Val Glu Glu Phe Ser Pro Ser He 
65 70 75 60 

Ala Arg Glu He Tyr Glu Met Tyr Glu Ala Val Ser Met Glu Pro Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO* 108: 
(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH* 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL l NO 

(iv) ANTI -SENSE I NO 

(v) FRAGMENT TYPE s N-terminal 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 108: 

Ser Thr Lys Lys Lys Pro Leu Thr Gin Glu Gin Leu Glu Asp Ala Arg 
15 10 15 

Arg Leu Lys Ala He Tyr Glu Lys Lyt Lys Asn Glu Leu Gly Leu Ser 
20 25 30 

Gin Glu Ser Val Ala Asp Lys Met Gly Met Gly Gin Ser Gly Val Gly 
35 40 45 

Ala Leu Phe Asn Gly He Asn Ala Leu Asn Ala Tyr Asn Ala Ala Leu 
50 55 60 

Leu Ala Lys He Leu Lys Val s r Val Glu Glu Ph Ser Pro ser He 
65 70 75 80 
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Ala Arg Glu lie Tyr Glu Met Cys Glu Ala Val ser Met Glu Pro Ser 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 109: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-tenninal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly He Val Glu Gin cys Cys Tbr ser He cys ser Leu Tyr Gin Leu 

1 5 10 15 

Glu Asn Tyr Cys Ann Met Ser Met Glu Gin Arg He Thr Leu Lya Asp 
20 25 30 

Tyr Ala Met Arg Phe Gly Gin Thr Lya Thr Ala Lys Asp Leu Gly Val 
35 40 45 

Tyr Gin ser Ala He Asn Lys Ala He Bis Ala Gly Arg Lys lie Phe 

50 55 60 

Leu Thr He Asn Ala Asp Gly Ser Val Tyr Ala Glu Glu Val Lys Pro 
65 70 75 80 

Phe Pro ser Asn Lys Lys Thr Thr Ala ser Aan Lys Lys Thr Thr Ala 
85 90 95 

Asn Ser Aen Thr Thr Pro He val His Leu Lys Gly Asp Ala Asn Thr 
100 105 110 

Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Tyr Thr 
115 120 125 

Ala Val Ser Ser Thr Trp His Trp Thr Gly Bis Asn Val Lys Bis Lys 
130 135 140 

Ser Ala He Val Thr Leu Thr Tyr Asp Ser Glu Trp Gin Arg Asp Gin 
145 150 155 160 

Phe Leu ser Gin Val Lys He Pro Lys Thr He Thr Val Ser Thr Gly 
165 170 175 

Ph Met Ser He 
180 



WO 96/17956 



PCT/US95/15944 



119 

(2) INFORMATION FOR SEQ ID NOtllOt 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH s 113 amino acida 

(B) TYPE: amino acid 
(D) topology i linear 

(ii) MOLECULE TTPEt protein 

(iii) HYPOTHETICAL* NO 
(iv) ANTI -SENSE t NO 
(v) FRAGMENT TYPEs N-terminal 

(Zi) SEQUENCE DESCRIPTION z SEQ ID NOtllOt 

Gly lie Val Glu Gin Cya Cya Thr sar He cya Bar Leu Tyr Gin Leu 

15 10 15 

Glu Aan Tyr Cys Aan Mat Ser Met Glu Gin Arg Ha Thr Leu Lya Aap 

20 25 30 

Tyr Ala Met Arg phe Gly Gin Thr Lya Thr Ala Lya Aap Leu Gly Val 
35 40 45 

Tyr Gin ser Ala He Aan Lya Ala lie His Ala Gly Arg Lya He Phe 

50 55 60 

Leu Thr He Aan Ala Aap Gly Ser Val Tyr Ala Glu Glu Val Lya Pro 
65 70 75 80 

Phe Pro Ser Aan Lya Lya Thr Thr Ala ser Aan Lya Lya Thr Thr Ala 
65 90 95 

Cya Aap Thr Aap Aap Arg His Arg He Glu Glu Lys Arg Lya Arg Lya 

100 105 110 

Thr 



(2) INFORMATION FOR SEQ ID NOtlllt 

(i) SEQUENCE CHARACTERISTICS S 

(A) LENGTH: 292 amino acids 
<B) TYPE s amino acid 
(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(v) fragment TYPEs N-terminal 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NOlllll 

Phe val Asn Gin Bis Leu CyB Gly ser Bia Leu Val Glu Ala Leu Tyr 
15 10 15 

Lou val Cya Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Met ser 
20 25 30 

Met Glu Gin Glu lie Thr Leu Lye Asp Tyr Ala Met Arg Phe Gly Gin 
35 40 45 

Thr Lya Thr Ala Lye Aap Leu Gly val Tyr Gin ser Ala He Aan Lys 

50 55 60 

Ala He Bie Ala Gly Arg Lys He Phe Leu Thr He Aan Ala Aap Gly 
65 70 75 60 

Ser val Tyr Ala Glu Glu Val Lya Pro Phe Pro ser Aan Lya Lya Thr 
85 90 95 

Thr Ala Ser Aan Lya Lys Thr Thr Ala Ser Ser Gly Ser Ser Gly Ser 
100 105 110 

Gly He Val Pro Gin Leu Gin Aan He val ser Thr val Aan Leu Gly 

115 120 125 

Cya Lys Leu Aap Leu Lya Thr He Ala Leu Arg Ala Arg Aan Ala Glu 
130 135 140 

Tyr Aan Pro Lya Arg Phe Ala Ala Val He Met Arg He Arg Glu Pro 
145 150 155 160 

Arg Thr Thr Ala Leu He Phe ser Ser Gly Lya Met val Cya Thr Gly 
165 170 175 

Ala Lys ser Glu Glu Gin Ser Arg Leu Ala Ala Arg Lya Tyr Ala Arg 
180 185 190 

Val Val Gin Lys Leu Gly Phe Pro Ala Lys Phe Leu Aap Phe Lys He 
195 200 205 

Gin Asn Met val Gly ser eye Asp val Lys Phe Pro He Arg Leu Glu 
210 215 220 

Gly Leu Val Leu Thr Bis Gin Gin Phe Ser Ser Tyr Glu Pro Glu Leu 
225 230 235 240 

Phe Pro Gly Leu He Tyr Arg Met He Lys Pro Arg He Val Leu Leu 
245 250 255 



He Phe Val ser Gly Lys val Val Leu Thr Gly Ala Lys Val Arg Ala 
260 265 270 



Glu II Tyr Glu Ala Phe Glu Asn He Tyr Pro He Leu Lys Gly Phe 
275 280 285 
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Arg Lys Thr Thr 

290 



(2) INFORMATION FOR SEO ID HO: 112: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY t linear 

(ii) molecule types protein 

(iii) HYPOTHETICAL I NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE i N-terminal 
(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 112: 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyx 
15 10 15 

Leu Val cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Met ser 
20 25 30 

Met Arg Gin Arg lie Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
35 40 45 

Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala lie Asn Lys 

50 55 60 

Ala lie His Ala Gly Arg Lys lie Phe Leu Thr lie Asn Ala Asp Gly 
65 70 75 60 

ser val Tyr Ala Glu Glu val Lys Pro Phe Pro ser Asn Lys Lys Thr 
85 90 95 

Thr Ala ser Asn Lys Lys Thr Thr Ala Gly Asp Pro Gly Lys Lys Lys 

100 105 110 

Gin His lie cys Bis lie Gin Gly Cys Gly Lys Val Tyr Gly Lys Thr 
115 120 125 

ser Bis Leu Arg Ala His Leu Arg Trp Bis Thr Gly Glu Arg Pro Phe 

130 135 140 

Met Cys Thr Trp ser Tyr cys Gly Lys Arg Phe Thr Arg ser Asp Glu 
145 150 155 160 

Leu Gin Arg Bis Lys Arg Thr Bis Thr Gly Glu Lys Lys Phe Ala cys 
165 170 175 

Pro Glu cys Pro Lys Arg Ph Met Arg Ser Asp His Leu Ser Lys Bis 
180 185 190 
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Zl Lya Thr His Gin Asn Lys Lys Gly Gly Pro Gly val Ala Leu ser 

19S 200 205 

Val Gly Thr Leu Pro Leu Asp ser Gly Ala Gly ser Glu Gly Ser Gly 
210 215 220 

Thr Ala Thr Pro ser Ala Leu lie Thr Thr Asn Met Val Ala Met Glu 
225 230 235 240 

Ala Zle eye Pro Glu Gly He Ala Arg Leu Ala Asn Ser Gly lie Asn 
245 250 255 

Val Met Gin Val Ala Asp Leu Gin Ser Zle Asn Zle ser Gly Asn Gly 

260 265 270 

Phe 



(2) INFORMATION FOR SEQ ZD NO: 113: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH t 421 amino acids 

(B) TTPEs amino acid 
(D) TOPOLOGYt linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL t NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPBt N -terminal 

(Xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 113: 

Gin Leu Tyr ser Ala Leu Ala Asn Lys eye Cys His Val Gly Cys zle 
15 10 15 

Lys Arg Ser Leu Ala Arg Phe cys Met Ser Met Arg Gin Arg lie Thr 
20 25 30 

Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin Thr Lye Thr Ala Lys Asp 
35 40 45 

Leu Gly val Tyr Gin Ser Ala lie Asn Lys Ala Zle His Ala Gly Arg 
50 55 60 

Lys lie Phe Leu Thr lie Asn Ala Asp Gly Ser Val Tyr Ala Glu Glu 
65 70 75 80 

Val Lys Pro Phe Pro Ser Asn Lys Lys Thr Thr Ala ser Asn Lys Lys 
85 90 95 

Thr Thr Ala Met Ala Asp Asp Asp Pro Tyr Gly Thr Gly Gin Met Phe 
100 105 110 
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Hie Leu Aen Thr Ala Leu Thr Bis s r lie Ph Aen Ala Glu Leu Tyr 
115 120 125 



ser Pro Glu lie pro Leu ser Thr Asp Gly pro Tyr Leu Gin He Leu 
130 135 140 

Glu Gin Pro Lya Gin Arg Gly Phe Arg Phe Arg Tyr Val eye Glu Gly 
145 150 155 160 

Pro Ser Bis Gly Gly Leu Pro Gly Ala Ser Ser Glu Lye Aen Lye Lye 
165 170 175 

Ser Tyr Pro Gin Val Lye He Cye Aen Tyr Val Gly Pro Ala Lye Val 
180 185 190 

He val Gin Leu val Thr Aen Gly Lye Aen He Bim Leu Bie Ala Bia 

195 200 205 

Ser Leu Val Gly Lye Bia Cye Glu Asp Gly Val Cye Thr Val Thr Ala 
210 215 220 

Gly Pro Lye Aep Met Val Val Gly Phe Ala Aen Leu Gly He Leu Bie 
225 230 235 240 

val Thr Lye Lye Lye Val Phe Glu Thr Leu Glu Ala Arg Met Thr Glu 
245 250 255 

Ala Cye He Arg Gly Tyr Aen Pro Gly Leu Leu val Bia ser Aep Leu 
260 265 270 

Ala Tyr Leu Gin Ala Glu Gly Gly Gly Aep Arg Gin Leu Thr Aep Arg 
275 280 285 

Glu Lye Glu He He Arg Gin Ala Ala Val Gin Gin Thr Lye Glu Met 

290 295 300 

Asp Leu Ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro Aep ser 

305 310 315 320 

Thr Gly ser Phe Thr Arg Arg Leu Glu Pro Val Val ser Aep Ala He 
325 330 335 

Tyr Aep Ser Lye Ala Pro Aan Ala 6er Aen Leu Lye He Val Arg Met 
340 345 350 

Aap Arg Thr Ala Gly cye val Thr Gly Gly Glu Glu He Tyr Leu Leu 
355 360 365 

Cye Aep Lye Val Gin Lye Aep Aep He Gin He Arg Phe Tyr Glu Glu 
370 375 380 

Glu lu Aid ly ly Val Trp lu Gly Phe Gly Aep Phe Ser Pro Thr 
385 390 395 400 

Aep val Bie Arg Gin Phe Ala II Val Phe Lye Thr Pro Lye Tyr Lye 
405 410 415 
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Aap Val Aan He Thr 
420 



(2) INFORMATI ON FOR SEQ ID NO: 114: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 391 amino acids 

(B) TYPE: anino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 114: 

Mat Arg Gin Arg I la Thr Lau Lys Aap Tyr Ala Mat Arg Pha Gly Gin 

15 10 15 

Thr Lya Thr Ala Lya Aap Lau Gly Val Tyr Gin Ser Ala lis Aan Lya 

20 25 30 

Ala lie His Ala Gly Arg Lya lie Phe Leu Thr lie Aan Ala Aap Gly 
35 40 45 

Ser Val Tyr Ala Glu Glu val Lys Pro Phe Pro ser Aan Lya Lya Thr 

50 55 60 

Thr Ala Met Ala Glu Aap Aap Pro Tyr Leu Gly Arg Pro Glu Gin Met 
65 70 75 80 

Phe His Leu Aap Pro Ser Leu Thr His Thr lie Phe Aan Pro Glu Val 
85 90 95 

Phe Gin Pro Gin Met Ala Leu Pro Thr Ala Aap Gly Pro Tyr Leu Gin 
100 105 110 

lie Leu Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val Cya 
115 120 125 

Glu Gly Pro Ser His Gly Gly Leu Pro Gly Ala Ser Ser Glu Lya Aan 
130 135 140 

Lys Lys Ser Tyr Pro Gin Val Lys lie Cys Aan Tyr Val Gly Pro Ala 
145 150 155 160 

Lys Val He Val Gin Leu Val Thr Asn Gly Lya Asn He Hie Leu His 
165 170 175 

Ala His s r Leu val Gly Lys Bis cya Glu Aap Gly He cya Thr val 
180 185 190 
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Thr Ala Gly pro Glu Asp eye Val Bis Gly Pho Ala Asn Leu Gly He 
195 200 205 

Leu Hie Val Thr Lye Lys Lye Val Pbe Glu Thr Leu Glu Ala Arg Met 

210 215 220 

Thr Glu Ala Cys lie Arg Gly Tyr Asn Pro Gly Leu Leu Val Bia Pro 
225 230 235 240 

Asp Leu Ala Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly 
245 250 255 



Aap Arg Glu Lye Glu Leu He Arg Gin Ala Ala Leu Gin Gin Thr Lye 

260 265 270 

Glu Met Asp Leu Ser Val Val Arg Leu Met Phe Thr Ala Phe Leu Pro 
275 260 285 

Asp ser Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Asp 
290 295 300 

Ala He Tyr Asp ser Lys Ala Pro Asn Ala ser Asn Leu Lys He Val 

305 310 315 320 

Arg Met Asp Arg Thr Ala Gly eye Val Thr Gly Gly Glu Glu He Tyr 
325 330 335 

Leu Leu Cye Asp Lys Val Gin Lys Asp Asp He Gin He Arg Phe Tyr 
340 345 350 

Glu Glu Glu Glu Aen Gly Gly Val Trp Glu Gly Phe Gly Asp Phe Ser 

355 360 365 

Pro Thr Asp Val Bis Arg Gin Phe Ala He val Phe Lys Thr Pro Lys 
370 375 380 

Tyr Lys Asp He Aen He Thr 

385 390 



(2) INFORMATION FOR 8EQ ID NO: 115: 

(1) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 391 anino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE z protein 

(iii) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE i N-terminal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Mat Glu Gin Glu He Thr Leu Lye Asp Tyr Ala Met Arg Phe Gly Gin 

15 10 15 

Thr Lys Thr Ale Lye Asp Leu Gly val Tyr Gin Ser Ala He Asn Lys 
20 25 30 

Ala lie His Ala Gly Arg Lys lie Phe Leu Thr lie Asn Ala Asp Gly 
35 40 45 

ser val Tyr Ala Glu Glu Val Lys Pro Phe Pro Ser Asn Lys Lys Thr 
50 55 60 

Thr Ala Met Ala Glu Asp Asp Pro Tyr Leu Gly Arg Pro Glu Gin Met 

65 70 75 80 

Phe His Leu Asp Pro ser Leu Thr Bis Thr He Phe Asn Pro Glu Val 
85 90 95 

Phe Gin Pro Gin Met Ala Leu Pro Thr Ala Asp Gly Pro Tyr Leu Gin 
100 105 110 

lie Leu Glu Gin Pro Lys Gin Arg Gly Phe Arg Phe Arg Tyr Val Cys 

115 120 125 

Glu Gly Pro Ser His Gly Gly Leu Pro Gly Ala ser Ser Glu Lys Asn 
130 135 140 

Lys Lys ser Tyr Pro Gin Val Lys He cys Asn Tyr val Gly Pro Ala 
145 150 155 160 

Lys Val He Val Gin Leu Val Thr Asn Gly Lys Asn He Bis Leu His 

165 170 175 

Ala Bis ser Leu val Gly Lys Bis cys Glu Asp Gly He cys Thr Val 
180 185 190 

Thr Ala Gly Pro Glu Asp Cys Val Bis Gly Phe Ala Asn Leu Gly He 
195 200 205 

Leu Bis val Thr Lys Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Met 
210 215 220 

Thr Glu Ala cys He Arg Gly Tyr Asn Pro Gly Leu Leu Val Bis Pro 
225 230 235 240 

Asp Leu Ala Tyr Leu Gin Ala Glu Gly Gly Gly Asp Arg Gin Leu Gly 
245 250 255 

Asp Arg Glu Lys lu Leu ll Arg Gin Ala Ala Leu Gin Gin Thr Lys 
260 265 270 

Glu Met Asp L u Ser Val Val Arg Leu M t Phe Thr Ala Phe L u Pr 
275 280 285 
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Asp ser Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Val Val Ser Aap 
290 295 300 

Ala He Tyr Asp Ser Lye Ala Pro Asn Ala ser Asn Leu Lys He Val 

305 310 315 320 

Arg Met Asp Arg Thr Ala Gly cya Val Thr Gly Gly Glu Glu lie Tyr 

325 330 335 

Leu Leu Cya Aap Lye Val Gin Lya Asp Aap He Gin lie Arg Phe Tyr 
340 345 350 

Glu Glu Glu Glu Aon Gly Gly Val Trp Glu Gly Phe Gly Aap Phe ser 
355 360 365 

Pro Thr Aap val Bia Arg Gin Phe Ala He val Phe Lya Thr Pro Lya 
370 375 380 

Tyr Lya Asp He Asn He Thr 
385 390 



(2) INFORMATION FOR SEQ ID NO: 116 1 

(1) SEQUENCE CHARACTERISTICS* 

(A) LENGTH i 241 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TTPEs protein 

(iii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: N-tenninal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 1 

Met Arg Gin Arg He Thr Leu Lys Asp Tyr Ala Met Arg Phe Gly Gin 
15 10 15 

Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gin Ser Ala He Asn Lys 
20 25 30 

Ala He His Ala Gly Arg Lys He Phe Leu Thr He Asn Ala Asp Gly 
35 40 45 

ser Val Tyr Ala Glu Glu val Lye pro Phe Pro ser Asn Lys Lys Thr 
50 55 60 

Thr Ala Ser Asn Lys Lys Thr Thr Ala Gly Asp Pro ly Lys Lys Lys 
65 70 75 80 

Gin Bis He Cys His He Gin Gly Cys Gly Lys Val Tyr Gly Lys Thr 
85 90 95 
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Ser His Leu Arg Ala His 

100 



Leu Arg Trp Bis Thr Gly Glu Arg Pro Phe 
105 110 



Met Cys Thr Trp ser Tyr 

115 



Cys Gly Lys Arg Phe Thr Arg Ser Asp Glu 
120 125 



Leu Gin Arg His Lys Arg 
130 



Thr Bis Thr Gly Glu Lys Lys Phe Ala Cys 
135 140 



Pro Glu Cys Pro Lys Arg Phe Met Arg Ser Asp Bis Leu Ser Lys Bis 
145 150 155 160 

He Lys Thr Bis Gin Asn Lys Lys Gly Gly pro Gly val Ala Leu ser 
165 170 175 

val Gly Thr Leu Pro Leu Asp Ser Gly Ala Gly Ser Glu Gly ser Gly 
180 185 190 

Thr Ala Thr Pro Ser Ala Leu lie Thr Thr Asn Met val Ala Met Glu 
195 200 205 

Ala He Cys Pro Glu Gly He Ala Arg Leu Ala Asn ser Gly He Asn 
210 215 220 

Val Met Gin Val Ala Asp Leu Gin Ser lie Asn He Ser Gly Asn Gly 
225 230 235 240 

Phe 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSl both 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
GGGAMTNYCC 10 



(2) INFORMATION FOR SEQ ID NO: 118 1 

(i) SEQUENCE CHARACTERISTICS: 
(A> length: 72 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 
(Hi) HYPOTHETICAL t NO 
(iv) ANTI -SENSE : NO 

(v) FRAGMENT TYPE: N-terminal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 8: 

Met Glu Pro val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cyi Tyr cys Lys Lys Cys Cys Phe 
20 25 30 

Bis Cys Gin Val Cys Phe lie Thr Lys Ala Leu Gly He Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala His Gin Asn ser Gin Thr 
50 55 60 

His Gin Ala ser Leu ser Lys Gin 

65 70 



WO 96/17956 



FCT/US95/15944 



130 

What is claimed is: 

1 1. A probe nucleic acid (PN A) comprising: 

2 (a) a single-stranded sequence, 1/2 TBR, which is capable of forming, under 

3 hybridizing conditions, a hybrid, TBR, with a 1/2 TBR present in a target nucleic 

4 acid(TNA); 

5 (b) a single stranded sequence, 1/2 BBR, which is capable of fanning, under 

6 hybridizing conditions, a hybrid BBR, with about 0-10 1/2 BBR present in a 

7 booster nucleic acid (BNA); and 

8 (c) an OSA, which is no attached support and/or indicator, or an attached support or 

9 other means of localization, including, but not limited to, attachment to beads, 

10 polymers, and surfaces, and/or indicators; 

1 1 wherein said TBR is capable of binding with high affinity to a TBA, said TBA being a substance 

12 capable of discriminating between a paired TBR and a TBR having unpaired nucleotides, and 

13 further, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a 

14 substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 

1 2. A booster nucleic acid (BNA) comprising: 

2 (a) 8 1/2 BBR which has a sequence which is cooqilenientary to a 1/2 BBR sequence 

3 in a PNA or another BNA and which is capable of forming, under hybridizing 

4 conditions, a hybrid, BBR, with the PNA; 

5 (b) an OSA, which is no attached support or indicator or an attached support or other 

6 means of localization, including, but not limited to, attachment to beads, polymers, 

7 and surfaces, and/or indicators; and 

8 (c) additional hybridization sites, 1/2 BBRs, for hybridization with additional BNAs; 

9 wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance 
10 capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. 

1 3. A Hairpin Nucleic Acid (HNA) comprising a single-stranded sequence, 1/2 BBR, which 

2 under hybridizing conditions is capable of forming a hairpin while at the same time binding to a 

3 BNA to form a BBR capable of binding a BBA, wherein said BBR is capable of binding with high 

4 affinity to a BBA, said BBA being a substance capable of discriminating between a perfect BBR and 

5 a BBR having unpaired nucleotides. 
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1 4. The PNA of claim 1 wherein the TBR is comprised ofone or more recognition sites for 

2 a nucleic acid binding protein, a DN A binding protein, a DNA-RNA hybrid binding protein or an 

3 RNA binding protein. 

1 5. The PNA of claim 4 wherem the TBR is a nucleic acti 

2 present in the genome of a pathogen or is a binding site associated with a pathogenic condition in 

3 a vertebrate genome or is a nucleic acid binding protein recognition site present in the genome of an 

4 organism which contaminates a fermentation process. 

1 6. The PNA of claim 4 wherein the TBR is the HTV-LTR or a portion thereof. 

1 7. A method for detecting or localizing a specific TNA sequence, comprising the steps of: 

2 (a) hybridizing said TNA with the PNA of claim 1; 

3 (b) hybridizing said PNA with a BNA containing a 1/2 BBR whose sequence is 

4 complementary to a 1/2 BBR sequence in the PNA; 

5 (c) adding the products of steps (a) and (b) containing a TBR and a BBR, to a surface, 

6 liquid or other medium containing a TBA; 

7 (d) adding BBAs to the mixture in step (c) wherein said BBA comprises: 

8 (i) a molecule or a portion of a molecule which is capable of selectively 

9 binding to a BBR; 

10 (ii) a detectible indicator, and 

11 (e) detecting signal produced by the indicator attached to the BBA. 

1 8. The method of claim 7 wherein said indicator is a protein, including enzymes capable f 

2 catalyzing reactions leading to production of colored reaction products; a radionuclide; colored 

3 beads. 

1 9. A method for detecting the pteseaoe in a sample of a specific Target Nucleic Acid, TNA, 

2 which comprises: 

3 (a) contacting said sample with a Probe Nucleic Add, PNA, which, upon hybridization 

4 with said TNA if present in said sample, forms a Target Binding Region, TBR, 

5 which is capable f binding a Target Binding Assembly, TBA; and 
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6 (b) contacting said sample, already in contact with said PNA, with a TBA capable f 

7 binding to any TBRs fbnned by the hybridization of said PNA and said TNA in the 

8 sample. 

1 10. A method for detecting or localizing specific nucleic acid sequences with a high degree 

2 of sensitivity and specificity which comprises: 

3 (a) adding PNAs containing a 1/2 BBR and a 1/2 TBR to a sample containing or 

4 suspected of containing TNAs containing 1/2 TBR sequences, to farm a complex 

5 having target binding regions, TBRs, formed by the hybridization of 

6 complementary 1/2 TBRs present in the PNAs and TNAs respectively; 

7 (b) binding the TBRs formed in step (a) to an immobilized TBA to form a TB A-TN A- 

8 PNA complex; 

9 (c) adding Booster Nucleic Acids, BNAs, containing booster binding regions, 1/2 

10 BBRs, to the complex formed in step (b) such that the 1/2 BBRs in the BNAs 

11 hybridize with the 1/2 BBR sequences present in the PNAs or to 1/2 BBRs present 

12 in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA- 

13 (BNAX, complexes are formed; 

14 (d) adding Hairpin Nucleic Acids, HNAs, containing 1/2 BBR sequences, to the 

15 complex formed in step (c) such that the 1/2 BBRs in the HNAs hybridize with any 

16 available 1/2 BBR sequences present in the BNAs of the complex of step (c), 

17 thereby capping the extension of the BNAs onto the TBA-TNA-PNA-(BNA) n 

1 8 complexes of step (c) to form TBA-TNA-PNA-(BNA) n -HNA complexes; 

19 (e) adding Booster Binding Assemblies, BBAs, linked to indicator moieties, to the 

20 TBA-TNA-PNA-(BNAVHNA complexes formed in step (d) to form TBA-TNA- 

21 PNA-(BNA-BBA) n -HNA complexes; and 

22 (f) detecting the signals produced by the indicator moieties linked to the TB As, PNAs, 

23 BNAs, BBAs or HNAs in the TBA-TNA-PNA-(BNA-BBA) 0 -HNA complexes of 

24 step(e); 

25 wherein the TNA comprises: 

26 (i) one or more specific 1/2 TBR nucleic acid sequences, the presence or 

27 absence of which in a particular sample is to be confirmed; 

28 the PNA comprises: 
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29 (i) a single-stranded sequence, 1/2 TBR, which is capable of forming, under 

30 hybridizing conditions, a hybrid, TBR, with a 1/2 TBR present in a target 

31 nucleic acid (TNA); 

32 (ii) a single stranded sequence, 1/2 BBR, which is capable of forming, under 

33 hybridizing oonftitww; a hybrid BBR with a 1/2 BBR present in a booster 

34 nudetc acid (BNA); and 

35 (iii) an OSA, which is no attached support and/or indicator, or an attached 

36 support or other means of localization, including, but not limited to, 

37 attachment to beads, polymers, and surfaces, and/or indicators; 

38 the BNA comprises: 

39 (i) a 1/2 BBR, as shown in Figure l(IIb), which has a sequence which is 

40 complementary to a 1/2 BBR sequence in a PNA and which is capable f 

41 forming, under hybridizing conditions, a hybrid, BBR, with the PNA; 

42 (ii) an OSA, which is no attached support or indictor or an attached support 

43 or other means of localintfton, including, but not limited to, attachment to 

44 beads, polymers, and surfaces, and/or indicators; 

45 (iii) additional hybridization sites, 1/2 BBRs, for other BN As; and 

46 (iv) sequences, 1/2 BBRs, which can hybridize to BN As already hybridized to 

47 the PNA; 

48 the BBA comprises: 

49 (i) a molecule or a portion of a molecule which is capable of selectively 

50 binding to a BBR; and 

51 (ii) an OSA, which is no attached support and/or indicator, or an attached 

52 support or other means of localization, including, but not limited to, 

53 attachment to beads, polymers, and surfaces, and/or indicators; 

54 and the TBA comprises: 

55 (i) a molecule or a portion of a molecule which is capable of selectively 

56 binding to a TBR; and 

57 (ii) no attached support and/or indicator, or an attached support or other 

58 means of localization, including, but not limited to, attachment to beads, 

59 polymers, and surfaces, and/or indicators. 



1 11. In a solid phase hybridization method for detecting the presence of a target 

2 polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, directly 
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3 or via an intermediate capture structure, on a solid phase at a capture site; before, during or after said 

4 immobilization, attaching a detectable label to said target polynucleotide, if present; and detecting 

5 said label if any, at said capture site; the improvement comprising: 

6 (a) using a Target Binding Assembly, TBA, as the means for achieving immobilization 

7 of said target polynucleotide, wherein said TBA binds only to a unique hybrid 

8 formed be tw een a specific Probe Nucleic Acid, PNA, and said target nucleic acid 

9 such that a perfect Target Binding Region, TBR, recognizable by said TBA is 

10 formed; and 

11 (b) including in the PNA a single stranded sequence, 1/2 BBR, capable of binding a 

12 Booster Nucleic Acid, BNA, containing a single stranded complementary 1/2 BBR 

13 which, upon hybridization with the 1/2 BBR in the PNA, forms a BBR capable of 

14 binding labeled Booster Binding Assemblies, BBAs. 

1 12. A target binding assembly, TBA, or a booster binding assembly, BBA, comprising at 

2 least one nucleic acid recognition unit, and optionally erne or all of the sequences selected from die 

3 group consisting of a linker sequence, an assembly sequence, an asymmetry sequence, a nuclear 

4 localization signal sequence (NLS) and an OS A. 

1 13. The TBA of claim 12 wherein the nucleic acid recognition unit is selected from the 

2 group consisting of an NF-kB binding unit, an SP1 binding unit, a TATA binding unit, a human 

3 papillomavirus E2 binding unit, an HPV LTR binding unit, an HIV LTR binding unit, and Tat. 

1 14. The TBA of claim 13 wherein the nucleic acid recognition unit has the sequence 

2 selected from the group consisting of SEQ ID NO. 63, SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID 

3 NO. 66, SEQ ID NO. 67, SEQ ID NO. 68, SEQ ID NO. 69, SEQ ID NO. 70, SEQ ID NO. 7 1, SEQ 

4 ID NO. 72, SEQ ID NO. 73, SEQ ID NO. 74, SEQ ID NO. 75, SEQ ID NO. 76, SEQ ED NO. 77, 

5 SEQ ID NO. 78, SEQ ID NO. 79, SEQ ID NO. 80, SEQ ID NO. 8 1, SEQ ID NO. 82, SEQ ID NO. 

6 83, SEQ ID NO. 84, SEQ ID NO. 93, SEQ ID NO. 94, SEQ ID NO. 95, SEQ ID NO. 96, SEQ ID 

7 NO. 97, SEQ ID NO. 98, and SEQ ID NO. 118. 

1 15. The TBA of claim 12 wherein the linker sequence is an oligopeptide which does not 

2 interfere with the nucleic acid recognition function of the nucleic acid recognition unit and which 

3 provides stability and control over the spacing of the nucleic acid recognition unit from the remainder 

4 of the TBA. 
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1 16. Tbc TBA of claim IS wherein the linker sequence is an oligopeptide sequence from the 

2 intentomain primary sequence of a structural protein. 

1 17. The TBA of claim 12 wherein the assembly sequence is an oligopeptide sequence which 

2 directs the folding and association of nucleic acid recognition units. 

1 18. The TBA of claim 17 wherein the assembly sequence is derived from the bacteriophage 

2 lambda cro protein or the CI protein and is selected from the group consisting of SEQ ID NO. 104, 

3 SEQ ID NO. 105, SEQ ID NO. 106, SEQ ID NO. 107, and SEQ ID NO. 108. 

1 19. The TBA of claim 12wherantheas>imnetry sequence directs the a 

2 acid recognition and assembly sequences in a predetermined order. 

1 20. The TBA of claim 19 wherein the asymmetry sequence is derived from insulin, 

2 gonadotropic hormone, FSH, HCG, LH, ACTH, or retain. 

1 21. The TBA of claim 20 wherein the asymmetry sequence is selected from the group 

2 consisting of SEQ ID NO. 85, SEQ ID NO. 86, SEQ ID NO. 87, SEQ ID NO. 88, SEQ ID NO. 89, 

3 SEQ ID NO. 90, SEQ ID NO. 91, and SEQ ID NO. 92. 

1 22. The TBA of claim 12 wherein the NLS is an oligopeptide which directs the migration 

2 and uptake of a protein or complex associated with said NLS into the nucleus of a cell. 

1 23. The TBA of claim 22 wherein the NLS is selected from the group consisting of SEQ 

2 ID NO. 72 and SEQ ID NO. 103. 

1 24. The TBA of claim 12 which is HIV Detect MV or HPV Detect I-IV. 

1 25. The TBA of claim 12 having a sequence selected from the group consisting of SEQ ID 

2 NO. 1 09, SEQ ID NO. 1 10, SEQ ID NO. 1 1 1, SEQ ID NO. 1 12, SEQ ID NO. 1 13, SEQ ID NO. 

3 114, SEQ ID NO. 115, and SEQ ID NO. 116. 



W 96/17956 



PCT/US95/15944 



136 

1 26. A method of using the TBA of claim 12 to bind a particular nucleic acid sequence in 

2 a target nucleic acid sample which comprises: 

3 (a) fragmenting the nucleic acid in the target nucleic acid sample; 

4 (b) contacting, under hybridizing conditions, the fragmented nucleic acid with a probe 

5 nucleic acid complementary to the particular nucleic acid sequence of interest, 

6 wherein said probe nucleic acid, upon hybridization with said particular nucleic 

7 acid sequence of interest forms a target binding region to which said TBA 
g specifically binds. 

1 27. The method of claim 26 wherein said probe nucleic acid, in addition to sequences 

2 complementary to said particular nucleic acid sequence of interest, also has ad ditional sequences to 

3 which a booster nucleic acid can bind to form a booster binding site to which a labeled booster 

4 binding assembly can bind to provide a signal showing and amplifying the binding of the probe 

5 nucleic acid to the target nucleic acid sequence of interest 

1 28. A method of using the TBA of claim 12 wherein said TBA is administered to patient 

2 in need of such treatment a therapeutically or prophylactkally effective amount of said TBA, which 

3 comprises administering the TBA, either in the form of a purified protein complex or in the form of 

4 a recombinant vector which, upon enliy into the patient is able to express the TBA, such that the 

5 TBA binds the particular nucleic acid sequence to achieve the desired prophylactic or therapeutic 

6 result 

1 29. The method of claim 28 wherein said TBA is selected from the group consisting of SEQ 

2 ID NO. 109, SEQ ID NO. 1 10, SEQ ID NO. 1 1 1, SEQ ID NO. 1 12, SEQ ID NO. 1 13, SEQ ID NO. 

3 114, SEQ ID NO. 115, and SEQ ID NO. 1 16, and the patient is infected with HIV or HPV. 

1 30. The method of claim 26 further comprising the step of: 

2 (c) monitoring the shift in mobility of nucleic acids in the target nucleic acid sample 

3 as a function of the size such that binding of the TBA to a particular fragment in 

4 the sample modifies the mobility of the fragment. 

1 3 1 . A diagnostic or forensic test kit for the detection in a sample of nucleic acid having a 

2 specific sequence composition, which comprises: 
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3 (a) a first nucleic acid probe complementary to nucleic acid with specific sequence 

4 composition, the presence of which is to be ascertained in a test sample, wherein said first nucleic 

5 acid probe and said nucleic acid with specific sequence composition forming, upon hybridization, 

6 a binding she for a first niriric acid binding protein, and wherein said first nucleic acid probe further 

7 comprises additional sequence complementary to a second nucleic acid probe; 

8 (b) a first nucleic acid binding protein specific for the duplex formed by hybridization of 

9 said first nucleic acid probe and said nucleic acid with specific sequence composition; 

10 (c) a second nucleic acid probe complementary to said additional sequence in said first 

11 nucleic acid probe, wherein, upon hybridization of said first and second nucleic acid probes, a 

12 binding site for a second nucleic acid binding protein is formed; 

13 (d) a second nucleic acid binding protein which binds specifically to the duplex formed 

14 upon hybridization of said first nucleic acid probe and said second nucleic acid probe, wherein said 

15 second nucleic acid binding protein is labeled with a detectable label. 

1 32. The diagnostic or forensic test kit of claim 3 1 wherein said first nucleic acid probe is 

2 complementary to the HIV LTR, such that upon hybridization of said first nucleic acid probe with 

3 an HIV LTR, a binding site is formed for NF-kB or a subunit thereof S P 1 , TATA binding protein, 

4 HIV-Detcct I, H m, or IV, or HTV-Lock. 

1 33. The diagnostic or forensic test kit of claim 32 wherein said first nucleic acid binding 

2 protein is NF-kB or a subunit thereof, SP 1 , TATA binding protein, HIV-Detect I, n, HI, or IV, or 

3 HIV-Lock. 

1 34. The diagnostic or forensic test kit of claim 33 wherein said first nucleic acid probe, in 

2 addition to being ccxnplementaty to the HIV LTR, comprises a sequence encoding the bacteriophage 

3 lambda left or right operator and said second nucleic acid probe comprises sequences complementary 

4 to said bacteriophage lambda left or right operator sequences in said first nucleic acid probe, such 

5 that upon hybridization of said first and second nucleic acid probes, a binding site for the 

6 bacteriophage lambda CI repressor protein, the bacteriophage lambda cro protein or a derivative or 

7 homology thereoC is formed. 

1 35. The diagnostic or forensic test kit of claim 34 wherein said second nucleic acid binding 

2 protein is the bacteriophage lambda CI repressor protein, the bacteriophage lambda cro protein r 

3 a derivative or homology thereof. 
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1 36. A composition comprising HIV-Lock or a recombinant vector encoding HTV-Lock and 

2 a phannaceutically acceptable carrier. 

1 37. A method of differentially binding a nucleic acid binding protein to a nucleic acid 

2 sequence correlated with a pathogenic condition which comprises: 

3 (a) selecting a particular configuration of nucleic acid binding protein sequences 

4 present in the nucleic acid sequence correlated with a pathogenic condition as a 

5 target sequence for designing a probe nucleic acid which will hybridize to that 

6 particular configuration of nucleic acid sequences if present in a test sample, and 

7 further, ensuring that a binding site for an available nucleic acid binding protein is 

8 formed upon hybridization of said probe nucleic acid and said particular 

9 configuration of nucleic acid sequences chosen as a target; 

10 (b) selecting a nucleic acid binding protein which specifically binds to the selected 

1 1 particular configuration of nucleic acid binding protein sequences correlated with 

12 a pathogenic condition, but which does not bind to sequences not correlated with 

13 said pathogenic condition; 

14 ( c ) hybridizing said probe nucleic acid with a test sample suspected of containing said 

15 particular configuration of nucleic acid binding protein sequences present in nucleic 

16 acid sequences correlated with a pathogenic condition; 

17 (d) contacting said nucleic acid binding protein with any hybrids formed in step (b); 

18 and 

1 9 ( c ) detecting any binding of said nucleic acid binding protein with said hybrids. 

1 38. The method of claim 37 wherein said particular configuration of nucleic acid binding 

2 protein sequences is chosen from a necessary step or control point in the development of a 

3 pathogenic condition. 

1 39. The method of claim 9 wherein said method is carried out in an automated fashion. 

1 40. The method of claim 39 wherein the method is carried out in the Abbott Laboratories 

2 IMx machine. 



1 



41. The method of claim 9 carried out in a microtiter plate. 
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1 42. A method of amplifying the signal obtained through binding the PNA of claim 1 to a 

2 TNA which comprises binding BNAs to the PNA-TNA hybrid and binding labeled BBAs to the 

3 BNAs. 

1 43. A method of assembling a nucleic acid binding complex which comprises using 

2 asymmetry sequences to direct the association or non-association of components of the nucleic acid 

3 binding complex. 

1 44. A method of assembling a nucleic acid binding complex which comprises using 

2 assembly sequences derived from bacteriophage lambda era or CI to assemble associated 

3 components of the nucleic acid binding complex. 

1 45. A method of using assembly, asymmetry, or piloting sequences to assemble a 

2 multimeric protein complex which comprises linking subunits to be incorporated into the multimeric 

3 protein complex to said assembly, asymmetry, or piloting sequence, and recovering said multimeric 

4 complex. 

1 46. A composition comprising SEQ ID NO. 105, SEQ ID NO. 106, or SEQ ID NO. 108. 

1 47. A nucleic acid encoding the TBA or BBA of claim 12. 

1 48. The TBA of claim 12 or a nucleic acid encoding said TBA, wherein the amino acid 

2 sequence of said TBA is selected from the group consisting of set A, set B and set C, wherein said 

3 sets are comprised as follows: 

4 Set Link Sequences from Groups 

5 a i+n+m 

6 B iv + v + m 

7 c iv+m 

8 wherein groups I»V consist of sequences selected from: 

9 Group Selected from Sequences 

10 I Any of SEQ ID NOS. 85-92 

1 1 II Met Ser, linked to any of SEQ ID NOS. 104-1 06, each of which is linked to SEQ 

12 ID NO. 99 
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13 ffl SEQ ID NO. 100 linked to any of SEQ ID NOS. 75-84 or 94-98; SEQ ID NO. 101 

14 lmkaitoc^SEQIDNO. 74ofSEQIDNO. 93;OTSEQroNO. 102 linked to 

15 SEQ ID NO. 74 or SEQ ID NO. 93; or any of SEQ ID NOS. 72, 103, 73, or 63-7 1 

16 IV Any of SEQ ID NOS. 104-108 

17 V SEQ ID NO. 99. 

1 49. A method of assembling multimeric TBAs in vivo or in situ which comprises 

2 introducing component TBAs into a cell utilizing a covalcntly or non-covaiently attached protein or 

3 bi-layer vesicle or by introducing nucleic acids encoding component TBAs into a cell, said 

4 component TBAs each comprising a DNA recognition unit, assembly sequences, asymmetry 

5 sequences, nuclear localization signal sequences, and optional linker sequences, such that upon 

6 proximal binding via the DNA recognition unit of each component TBA to nucleic acid sequences 

7 encountered in the nucleus or elsewhere in the cell, component expressed TBAs assemble via said 

8 assembly and asymmetry sequences into multimehc TBAs. 

1 50. A method for identifying nucleic acid binding molecules for preparation of a target 

2 binding assembly or a booster binding assembly comprising: 

3 a. Obtaining a sample containing the target nucleic acid; 

4 b. Fragmenting the sample so as to expose the nucleic acids and to reduce the size 

5 complexity of the nucleic acids contained in the sample; 

6 c. Contacting a first aliquot of the fragmented nucleic acids with a control buffer 

7 medium and contacting a second aliquot of the fragmented nucleic acids with the 

8 control buffer medium containing a known profile of nucleic acid binding 

9 molecules; 

10 d. Analyzing the two aliquots to identify fragments which have altered behavior in the 

1 1 aliquot contacted with the target binding molecules as opposed to the aliquot 

12 contacted with the control buffer medium; 

13 e. Identifying and isolating fragments which do exhibit altered behavior when 

1 4 contacted with the nucleic acid binding molecule and either sequencing the nucleic 

15 acid fragment to determine whether known nucleic acid binding molecule motifs are 

16 present, or directly identifying the nucleic acid binding molecule bound to the 

17 nucleic acid; and 
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1 8 f. synthesizing TBAs comprising the nucleic acid binding molecules which produced 

19 the altered behavior using assembly, asymmetry nuclear localization and, 

20 optionally, linker sequences. 

1 51. A method for identifying specific nucleic acid sequences in a sample 

2 comprising: 

3 a. Fragmenting the nucleic acids in said sample to expose the nucleic acids and reduce 

4 the size complexity of the nucleic adds; 

5 b. Contacting a TBA with the sample, said TBA comprising two or more nucleic acid 

6 binding components each of which has a relatively weak binding for its nucleic acid 

7 recognition unit within the TBR but which in combination provides strong binding 

8 for the complete TBR; and 

9 c. Eliminating any "cross-talk" produced by binding of the TBA to cousin nucleic 

10 acids that contain individual recognition units, which comprises contacting the 

1 1 sample with excess nucleic acid binding components with relatively strong binding 

12 affinity for cousin nucleic acids that contain the individual recognition units but 

13 relatively weak binding relative to the TBA's affinity for binding to the complete 

14 TBR having said two or more nucleic acid binding components. 



WO 96/17956 



1/29 



PCT/US95/15944 




ctiDv i ni i i L qrocT /QIH C MX 



WO 96/17956 



2/29 



PCT/US95/15944 




WO 96/17956 



3/29 



PCTAJS9S/1S944 




cuDCTmirrQDm 1 taw 1 n\ 



WO 96/17954 



PCT/US95/15944 




WO 96/17956 




PCT/US9SH5944 

5/29 




cnficrninrttirrr /mm r *t\ 



W 96/17956 

6/29 



PCTAJS9S/15944 




SUKmUTFSBEn (RULE 2tt 



WO 96/17956 



PC17US95/1S944 



7/29 




WO 96/17956 



9/29 



PC17US9S/15944 



BBR-BBA 
COMPLEX 




(VII) 





FIGURE 4B 



aftsnnmrcumr /bin f n\ 



W 96/17956 



10/29 



FCT/US95/15944 



(VA) 



(VIIA) 



(IXA) 



V 2 TBR V 2 BBR 
EH rfnuihiiiirtmi ' i frEI 



(HA) 



(IIIA) Eh 



TNA 



CNA 

TTTTJTJTT" 



TBR V,BBR 



MM 



TBR-TBA 
COMPLEX 





TBA 



+ 

BNA 
ifTmfri E l 



V 2 TBR V 2 BBR 

El ill mifnuu li ni rfrEI 
nfifini 

CNA 



TBR-TBA 
COMPLEX 




V 2 TBR 



BBR 

GD iT'iiniftiuif inrriHS) 
iiTifnn yi niig i.joi 

CNA 



BBA 



TBR-TBA 
.COMPLEX 



(I) 

(hb) 



V 2 TBR V 2 BBR 
1 1 1 ul 1 1 1 1 1 f r Ei + (g l ili i ini'iinu1nmi , i ED (MB) 

CNA 



(IV) 



(VB) 



(VI) 



(VIIB) 



(VIII) 



BBR-BBA 
COMPLEX 




V 2 TBR 



BBR-BBA 
COMPLEX 



ED in inn 
CNA 




(IXB) 



FIGURE 4C 



W 96/17956 



11/29 



PCT/US95/ 15944 



(IA) la} l i ning 

^TBR-i V 2 BBR 

(IB) to) ,<7TT7m > > 1 1 1 1 1 1 1 1 1 1 Ml 



^TBR-j V 2 TBR 2 
(IIA) EJ- 

V 2 TBR-| V 2 TBR 2 V 2 BBR 
(HB) E3 1 1 n iP i 1 1 1 1 il 1 1 1 m*! 1 1 n i *i n i n P ii 

l^TBR<| V 2 TBR 2 V 2 TBR 3 
(HIA) ED ri i ni i Piinii'iiMi'Tp i iMiPi ii i ii P t i 

\tbi^ V 2 tbr 2 V 2 tbr 3 V 2 bbr 
(hib) ® 1 1 1 1 1 i * i 1 1 1 1 i *i 1 1 1 1 1 f *i 1 1 1 1 i ^i 1 1 1 1 1 n 1 1 1 1 i * i 1 1 1 1 r -it 



^tbRt V 2 tbr 2 V 2 tbr 3 V 2 tbr 4 

(IVA) E3 1 1 1 ii Pi 1 1 1 1 i *i 1 1 1 1 1 p i 1 1 1 1 i *T i 1 1 1 1 Pi 1 1 1 1 i * i i n n Pi 

V 2 tbr-, V 2 tbr 2 V 2 tbr 3 V 2 tbr 4 V 2 bbr 

(IVB) IpJ KTTTTTi N i n i i * r77T77>i 1 1 1 1 1 *] 1 1 1 1 n* i ui i rtniiiP ii ii i Pii i niP i 

V 2 TBR 1 V 2 TBR 2 V 2 TBR 3 V 2 TBR 4 V 2 TBR 5 
( va ) ® i *i 1 1 1 1 1 Pi 1 1 1 1 i *i 1 1 1 1 1 Pi 1 1 1 1 i*i n n i Pi 1 1 1 1 i * i 1 1 1 1 1 Pi 1 1 1 1 il 1 1 1 nrPr 

V^TBR, V 2 TBR 2 V 2 TBR 3 V 2 TBR 4 V 2 TBR 5 V 2 BBR 
(VB) IpJ i*i i i i i i Pi i i i i i *i i i i i i P i i i i i i f n i i i i P i i i i i i f u i i i i P i i i i i i^iTi 1 1 1 Pn 1 1 1 i r i 1 1 1 1 1 Pi 



FIGURE 5 



W 96/17956 



12/29 



PCT/US95/15944 



PNA 

V 2 TBR 1 V 2 TBR 2 V 2 BBR 



m 1 1 1 1 ri 1 1 1 1 n 1 1 1 1 1 f 1 1 1 1 1 n 1 1 1 1 1 r i i 



(I) 



TNA 

l jniHy i"ii |Miny _ 

^TBR r V 2 TBR 2 . 




TBR r TBA II 



XOMPLI 




TBR- 



TTTTII. TTTTf TTTIfTl 



V 2 BBR 



1 1 1 1 1 1 1 1 1 1 1 ri i 




TBR 



TBR 2 -TBA 
JCOMPLEX. 



1 



[2h 



VoBBR 



<■■■■■■! ■ IllllMk 
llllllll ■ ■■■■ TTfffl 



1 1 1 1 1 in ii 1 1 1 ri i 




(id 



(in) 



(IV) 



(V) 



(VI) 



TBR r TBA TBR 2 -TBA 
COMPLEX ^ COMPL! 




V 2 BBR 



(Ulilll Hill Ullillt 
rfffTTn. TWfff TFTTTT, 



1 1 1 1 1 n 1 1 1 1 1 f 1 1 




(VII) 



FIGURE 6A 



WO 96/17956 



13/29 



PCT/US9S/1S944 



PNA 

V 2 TBR-, V 2 TBR 2 V 2 BBR 




DOUBLE 
TBR r TBA 2 




FIGURE 6B 



W 96/17956 



14/29 



PCTA1S9S/15944 



V 2 TBR 1 V 2 TBR 2 V 2 TBR : 



PNA 



V 2 TBR 4 



V,TBR 5 V 2 BBR 



hi 1 1 1 rt i 1 1 \ifi 1 1 1 1 n ii if n 1 1 1 1 1 1 nfi i m ft 1 1 1 1 1 ft i dSS* il 1 1 1 1 1 fr (I) 



TNA 



■t jllllll > imil Jlllll^ llllll jllMi| i"" Miiiii^ rititi jtiiii^ 

V 2 TBR V V 2 TBR 2 . V 2 TBR 3 . V 2 TBR 4< V 2 TBR 5 . 




TBR r TBA || 

TBR 2 TBR 3 TBR 4 TBR 5 V 2 BBR 
El t i i it n »w \ 1 1 1 f i ii 1 1 ft 1 1 1 1 f 1 1 1 1 1 ft 1 1 1 1 f 1 1 1 1 1 ft i » 1 1 fl 1 1 1 1 1 ft i fii ff 1 1 1 1 t f t 




TBRi 



TBR 2 -TBA 
JCOMPLEX. 



t ?iniiftim 




TBR 3 TBR 4 TBR 5 V 2 BBR 



ui i f iiniT fciiii iffTTrrhii ii^ TMiiift i ^ ^i iT iiii ii 



(ID 



(HI) 



(IV) 



(V) 



(VI) 



TBR-j -TBA TBR 2 -TBA 
XOMPLEX^COMPLf 




TBRr 



TBR, 



TBRc V,BBR 



tlllllil AA1IA LUllli'l illli ttitUIl UUI lilillii IUI1 UI1IIIJ 
llllll' fffffftff rflffffV Tffflff ■■■■■ muni Hill VIIIII VI 



\ \ fiS\ rtn ii if i 




(VII) 



FIGURE 6C 



tnsrnirTrQum- raid r n\ 



WO 96/17956 



15/29 



PCT/US95/15944 



TBR.,-TBA 
.COMPLEX. 




TBR« 



TBA 




TBR-TBA 
COMPLEX 



TBR-TBA 
COMPLEX 



7 V 



1 \ 



'IIIIIH Illlllllllllllllll^lllimtilffV iii»»m» ^»»in 
mil VTTTTT1 VfTTl IfflfTfll ■ ■■■■ nfffffftffl ffffffl rfffffftfffff 



TBR-TBA 
COMPLEX V- 



2 BBR 
nrm 





/<§> 





<3> 



(VIII) 



(DC) 



TBR->-TBA 

COMPLEX TBR-TBA TBR-TBA TBR-TBA 
TBR, >v J>/ COMPLEX COMPLEX COMPLEX V 9 BBR 

IXj It 1 1 i J i if 1 1 1 1 n i i 1 1 ivi 1 1 1 W i M r ^ f vt i 1 1 m f » r e r i r» i i i kI L I i r L f II 1 1> J 1 1 1 1 1 1 M II 



1I111 ] imr|< dltmt iULU 1UUW U1U U 1142 CI 

muni ■■■■■ XHITV urn wfTfTf JTTTTi ifTfifT! TTTIi RfTfTf] 






(X) 



TBR,-TBATBR->-TBA 

COMPLEX ^COMPLEX.TBR-TBA TBR-TBA TBR-TBA 

COMPLEX COMPLEX COMPLEX 




Willi' I uiii wiiir t uiv utniu uur mini! jw^ untie 

viiiii* 1 villi vllllll 1 llllfc fJfTfTf? Tim FfTfTfTl IWITI rTflfTfl 



V 2 BBR 
n 65i rt m 1 1 1 f i 






(XI) 



FIGURE 6D 



WO 96/17956 



16/29 



PCI7US95/I5944 



PNA 



V 2 TBRj V 2 TBR 2 V 2 TBR 3 V 2 TBR 4 V 2 TBR 5 V-,BBR 
I2 ] i i 1 1 1 1 1 i*i 1 1 1 1 n 1 1 1 1 1 1 1 1 i i i n 1 1 ri 1 1 1 1 n 1 1 1 1 1 fi 1 1 1 1 ifiTm^rT^SrrhTTTr^r (I) 



TNA 

i j iiii [g iim j 11 iig i ii ii gi m i^ i i i 1 i ji ii iu i i". 
V 2 TBR r V 2 TBR 2 . V 2 TBR 3 . V 2 TBR 4 . V 2 TBR 5 . 

TBR 7 TBR 2 TBR3 TBR 4 TBR5 V 2 BBR 

0 3 f n i iiftunrf 
ii 1 1 1 1 ; ■! ■ ■ ■ 1 1 ■ 




1 1 1 1 1 ft 11 11 f 1 1 1 1 1 ft 1 1 1 1 ti 1 1 ) 1 1 ft m 1 1 1 1 1 1 1 f t 1 fli 11 1 1 1 1 1 > i /TTT . 
1 1 1 1 1 J 1 n 1 1 1 m J 1 1 1 1 y t 1 " "f " " 'IHlJ . ,,,,,,Tr (HI) 



DOUBLE 



DOUBLE 
TBR-|-TBA 2 





It 

TBR3 TBR< 



TBRc VoBBR 



ainii i ihh 1 HHK 1 inn 1 1 1 11 1 11 urn 111 1 11 11 iim ii m i ii 



5 e 'V 

I lijl ll 1 1 1 1 ifl 



(IV) 




(V) 



TBA 




(VI) 



DOUBLE 
TBR r TBA 2 
.COMPLEX 



TBR-TBA TBR-TBA TBR-TBA 
COMPLEX COMPLEX COMPLEX V 2 BBR 




FIGURE 6E 



WO 96/17956 



17/29 



PCTAJS95/15944 



00 

m 



uj 

00 



t 



0) 



* 
* 

< 

< 

t 

< 



< 

o 

3 

# 



• * 



o 

E 

o 



CsJ 



0) 



< 

»- 





• • 














o 


§ 






-M 


S- 






C 


M- 






0) 








E 








•p-» 


CU 






r-» 


E 








1. 






E 


O 






O 








u 










< 














•r-< 


o 














< 


o 
















o 








L- 






c 


<U 
















o 






<U 








U 


0) 






C 








0) 


-M 






3 






* 






• 


0) 


O 




ft 


lO 














rn 


T3 


-o 


m 


«i 


C 


c 


-j 


o 




<U 


o 


i 






i 






0) 


Csl 




J* 


x: 


-J 




u 




o 


o 






» 


t 




M- 






to 


O 


1 








o 






o; 




o 




c 






»- 


o 







o 



ft 
ft 



OC O 
O I 
I - 
(SI l\J 

cc cc 
o o 
t i 
m - 
cc m 



WO 96/17956 



PCTAJS95/1S944 



18/29 




W 96/17956 



19/29 



PCTAJS95/ 15944 




WO 96/17956 



20/29 



PCTAJS95/15944 



TBA: TARGET BINDING ASSEMBLY 




FIGURE 9 

tt im' i miiL gunr /mhc m\ 



W 96/17956 



21/29 



PCT/US95/15944 





HIV-DETECT I 



HIV-DETECT II 





HIV-DETECT 



HIV-DETECT IV 



FIGURE 10 



m m » ■ 



WO 96/17956 



PCT/US9S/1S944 




HPV-DETECT III 



22/29 




HPV-DETECT IV 



FIGURE 1 1 



WO 96/17956 



23/29 



PCT/US95/1S944 




FIGURE 1 2A 



W 96/17956 



24/29 



PCT/US95/15944 



PNA-TNA HYBRID AND 
OTHER SAMPLE DNA 



+ TBA 



+ BNA 




I ■■ ■■ 



TIT 



■ ■ ■ II 



PNA-CNA 
HYBRID 

wrm 

BBR-BBA 
COMPLEX 
ATTACHED 
TO PNA. 




PNA-TNA HYBRID 
WITH BOUND TBA AND 
BBR-BBA COMPLEX 
ATTACHED TO PNA 



FIGURE 12B 



W 96/17956 



PCT/US95/I5944 



25/29 



(IA) 



A 

Y/S/S////S///JI 



B 



C 



A C 



(IB) 



(II) 



B PNA 



B TBR 



(IIIA) Y/y/y///////A 



A C 

yyyyyyyy///y/yjv!^tst^^ (IIIB) 



(IV) 



AC PNA 



B TBR 



(VA) 



'/////////////A 
'A 



AC TBR 



(VB) 



FIGURE 13 



WO 96/17956 



PCT/US9S/1S944 



26/29 




BINDS ONE OR MORE BINDS ONE OR MORE 

NFKB BINDING SITES SP1 BINDING SITES 




BINDS ONE OR MORE NFKB BINDS ONE OR MORE NFKB 

AND SP1 BINDING SITES AND SP1 BINDING SITES 



FIGURE 1 4 



WO 96/17956 



27/29 



PCT/US95/1S944 





BINDS ONE OR MORE E2 BINDS E2 AND ONE OR MORE 

AND TATA BINDING SITES TATA BINDING SITES 



FIGURE 1 5 



WO 96/17956 



28/29 



PCT/US95/15944 



NON-THERAPEUTIC THERAPEUTIC 
COUSIN SITE COUSIN SITE 




0g(]> (DBY) 

TARGET BINDING 
VcS-^ ASSEMBLY (DBX'-DBY) 



(IIIA) 



COUSIN 
TBR 



TARGET 
TBR, 



TARGET nnm 
TBR,- (IIIB) 





FIGURE 1 6 



WO 96/17956 



29/29 



PCT/US95/15944 




u 

LU 

-J 
u 

ZD 



o 
< 



LU 





substitute shht (rule J*) 



